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TITLE OF THE INVENTION; Secreted soluble a?5-2. a,5-3 or a ? 5-4 calcium 
channel subunit polypeptides and screening assays using same 
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5 Background of the invention 3 
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Voltage-dependent Ca 2+ channels (VDCCs) are heteromultimeric complexes present in both 

neuronal and non-neuronal tissues, including heart and skeletal muscle. VDCCs are d 

minimally composed of three subunits: a pore-forming transmembrane cti subunit, a 

10 hydrophilic intracellular p subunit, and a membrane-associated 0126 subunit; a 

transmembrane y subunit is also found in skeletal muscle tissue. Multiple subtypes and/or w 

w 

splice variants of the on, P, and ct25 subunits have been found. §5 

x 



Gabapentin ((l-aminomethyl)cyclohexane acetic acid or Neurontin) is a structural analogue 
15 of GAB A, which is mainly used as an adjunctive therapy for epilepsy. Recent research 
suggests that gabapentin may also have clinical utility for various indications including 
anxiety and pain. Although designed as a lipophilic GABA-mimetic, gabapentin does not 
have a high affinity for either GABA A or GABAb receptors, GAB A uptake sites, or the 
GABA-degrading enzyme GABA-transaminase (EC 2.6.1.19). 

20 

A novel high affinity binding site for [ 3 H]gabapentin in rat, mouse, and porcin brains has 
been characterized. Recently, the [ 3 H]gabapentin-binding protein was isolated from pig 
brain and identified as the 0:26-1 subunit of VDCCs. None of the prototypic anticonvulsant 
drugs displace [ 3 H]gabapentin binding from the 0.28-1 subunit. [ 3 H]Gabapentin-binding is 
25 stereospecifically inhibited by two enantiomers of 3-isobutyl GABA. The rank order of 
potency of gabapentin, and S- and R-isobutyl GABA, at the [ 3 H]gabapentin binding site 
mirrors their anticonvulsant activity in mice. However, electrophysiological studies have 
yielded conflicting data on the action of gabapentin at VDCCs. 

30 The 0128 subunit is derived from a single gene, the product of which is extensively post- 
translationally modified particularly through the cleavage of the signal sequence. The 
polypeptide is cleaved to form disulfide-bridged a 2 and 5 peptides, both of which are 
heavily glycosylated. Although it seems clear today that the 012 and 5 peptides are 
membrane-associated peptides, it is unclear whether these peptides comprise one or several 

35 transmembrane domains. Furthermore, the location, size and structural configuration of 
these eventual transmembrane domains remains to be determined. 



But in any event, the fact that 0128 is a membrane-associated protein, regardless of its 
precise structural configuration, renders its large scale expression in recombinant systems 
difficult. Indeed, as the 0:28 protein is targeted to the membrane, it requires detergent 
solubilisation to release it for purification. This important drawback imposes considerable 
5 restrictions for any potential applications requiring large amounts of recombinant protein. 
Furthermore, the various subtypes of a 2 8 subunits are different proteins with very low 
homologies. It is therefore extremely difficult to predict their respective behaviors, for 
example in gene truncation experiments. 

10 The only assay currently available for the screening of ligands that bind the a 2 5 subunit 
involves the use of pig membranXextracts as a source of the a 2 5 subunit. Such an assay 
presents major inconvenients. Firstly^cause the assay material is a membrane extract, it is 
very difficult to accurately determinqjne protein composition from one assay preparation 
to another particularly with regard to theVibtype. Also, the presence of various impurities 
15 in the assay preparation is a problem in Small plate assays. Furthermore, as the protein 
preparation lacks homogeneity, the interactioJi between the targeted protein and the assay 
plate is often quite uneven. This rendres the streamlining of the assay in a high throughput 
format almost impossible to achieve. \ 

20 

Summary of the invention 

In the context of the present invention, the inventors have found that it was possible to 
delete a portion of the nucleotide sequence encoding a eukaryotic, preferably a mammal 
25 cerebral cortical voltage-dependent calcium channel a 2 subunit to yield a soluble secreted 
protein which retains its affinity for [ 3 H]gabapentin. 

t 

The invention concerns: 

30 1) A purified or isolated nucleic acid encoding a mammalian secreted soluble cerebral 
cortical voltage-dependent calcium channel a 2 8-2, a 2 8-3 or a 2 5-4 subunit polypeptide. 

2) A purified or isolated nucleic acid according to 1), comprising a polynucleotide having 
at least 90% identity with the sequence encoding : 




Hp 



from amino-acid 1 to between amino-acids 1027 and 1062 of SEQ ID N°20 for ct2S-2, 



1/ - from amino-acid 1 to betweemamino-acids 984 and 1019 of SEQ ID N°22 for cc 2 8-3. 



3) A purified or isolated nucleic acid according to 1), having at least 90% identity with the 
5 sequence encoding : 

from amino-acid 1 to between amino-acids 1047 and 1062 of SEQ ID N°20 for 0128-2, 
from amino-acid\ to between amino-acids 1004 and 1019 of SEQ ID N°22 for ot25-3. 



4) A purified or isolated nucleotide sequence according to 1) wherein said sequence is the 
AJo? sequence of SEQ ID NW, SEQ ID N°2, SEQ ID N°3, SEQ ED N°7, SEQ ID N°8, SEQ ID 
' N°9, SEQ EDN°13, SEC^D^M, SEQEDN°15, SEQ ID N°19 or SEQ ED N°21. 



5) A purified V isolated nucleic acid, having at least 90% identity with the nucleotide 
IjX sequence of SEQp N°19 or SEQ ED N°21. 

15 

6) A purified or isolated polynucleotide comprising at least 10 consecutive nucleotides of 
the nucleotide sequencW SEQ ED N° 19 or SEQ ED N°21. 





7) A polynucleotide probe or primer hybridizing, under stringent conditions, with the with 
the nucleotide sequence of SEQ ED N° 19 or SEQ ED N°21. 
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8) A method for the amplification of a nucleic acid encoding a mammalian secreted 
soluble cerebral cortical voltage-dependent calcium channel 0128-n subunit polypeptide 
wherein n is 2, 3 or 4, said method comprising the steps of: 

(a) contacting a test sample suspected of containing the target secreted soluble a 2 5- 
n subunit nucleic acid, or a sequence complementary thereto, with an amplification 
reaction reagent comprising a pair of amplification primers located on either side of 
the 0128-n subunit nucleic acid region to be amplified, and 

(b) optionally, detecting the amplification products. 

9) A kit for the amplification of a nucleic acid encoding a secreted soluble a.25-n subunit 
polypeptide wherein n is 2, 3 or 4, or a complementary sequence thereto in a test sample, 
wherein said kit comprises: 



I* 



(a) a pair of oligonucleotide primers which can hybridize, under stringent 
conditions, to the secreted soluble 0128-n subunit nucleic acid region to be 
amplified; 

(b) optionally, the reagents necessary for performing the amplification reaction. 

10) A recombinant vector comprising a nucleic acid according to 1). 

1 1) A recombinant host cell comprising a nucleic acid according to 1). 

12) A method for producing a secreted soluble a.25-n subunit wherein n is 2, 3 or 4, and 
said method comprises the steps of: 

(a) inserting the nucleic acid encoding the desired ot25-n subunit polypeptide in an 
appropriate vector; 

(b) culturing, in an appropriate culture medium, a host cell previously transformed 
or transfected with the recombinant vector of step (a); 

(c) harvesting the culture medium thus obtained or lyse the host cell, for example 
by sonication or osmotic shock; 

(d) separating or purifying, from said culture medium, or from the pellet of the 
resultant host cell lysate, the thus produced ct25-n subunit polypeptide of interest. 

13) A purified or isolated recombinant polypeptide comprising the amino acid sequence of 
a secreted soluble ot25-2, ot2S-3 or ot28-4 subunit polypeptide. 

14) A recombinant polypeptide according to 14), having at least 80% amino-acid identity 
with a polypeptide comprising : \/^ 

- from amino acid 1 to between amincTatids 1027 and 1062 of the amino acid sequence 
ofSEQIDN o 20, or \ 

- from amino acid 1 to between amino acids\019 and 1079 of the amino acid sequence 
ofSEQIDN 0 22. \ 

15) A recombinant polypeptide according to 14), wherein said recombinant polypeptide is 
selected from the group consisting of the amino acid sequences of SEQ ID n°4, SEQ ID 
n°5, SEQ ID n°6, SEQ ID n°lV SEQ ID n°l 1, SEQ ID n°12, SEQ ID n°16, SEQ ID n°17 
and SEQ ID n° 18. 



16) A method for the screening of ligands which bind a cerebral cortical voltage- 
dependent calcium channel o^S-n subunit wherein n is 2, 3 or 4, said method comprising 
the steps of: 

- contacting a secreted soluble recombinant calcium channel o^S-n subunit 
polypeptide with: 

- a ligand of interest; and 

a labelled compound which binds the o^S-n subunit; and 

- measuring the level of binding of the labelled compound to the ct28-n subunit. 

17) A method according to 16), wherein said method is a scintillation proximity assay. 

18) A method according to 16), wherein said method is a flashplate assay. 

19) A method according to 16), wherein said method is a filter binding assay. 

20) A method according to 16), wherein said secreted soluble recombinant calcium 
channel a25-n subunit polypeptide is selected from polypeptides having at least 80%, 
preferably 90%, more preferably 95%, and most preferably 98 or 99% amino-acid identity 
with the polypeptide comprising>yfrom amino acid 1 to between amino-acids 984 and 1063, 
preferably between amino-acids 994 and 1054, and most preferably between amino-acids 
1019 and 1054 of SEQIDN 0 5 or SEQIDN 0 16. 

21) A method according, to 16), wherein said secreted soluble recombinant calcium 
channel a25-n subunit polypeptide is selected from the group consisting of SEQ ID N°6, 7, 
8, 9, 13, 14 and 15, with the polypeptides of SEQ ID N°9 and SEQ ID N°15 being most 
preferred. » 

22) A kit for the screening of ligands which bind bind a cerebral cortical voltage- 
dependent calcium channel ct25-n subunit wherein n is 2, 3 or 4, said kit comprising: 

a secreted soluble recombinant calcium channel oi25-n subunit; and 
a labelled compound which binds to the a.28-n subunit. 
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Hence, the invention concerns nucleotide sequence fragments of a cerebral cortical voltage 
dependent calcium channel 026-2, 0:28-3 or 028-4 subunit cDNA encoding a soluble 
secreted 026-2, ct25-3 or a.25-4 subunit polypeptide(hereinafter a a25-2, ct28-3 or a25-4 
5 subunit^ Preferably, these nucleotide sequences encode a soluble secreted 026-2, ot25-3 or 
0C28-4 subunit polypeptide bearing a gabapentin or a [ 3 H]gabapentin binding site. More 
preferably, the soluble secreted ct25-2, 0:26-3 or 0:28-4 subunit nucleic acid is derived from 
a eukaryotic, preferably a mammal, more preferably a human ot25-2, ct25-3 or a28-4 
subunit. 

10 

A further object of the present invention concerns recombinant vectors comprising a 
nucleic acid sequence encoding a soluble secreted 0.26-2, 028-3 or a.25-4 subunit 
polypeptide. 

The invention also encompasses host cells and transgenic non-human mammals 
15 comprising said nucleic acid sequences or recombinant vectors. 

The invention also concerns a soluble secreted 028-2, 0126-3 or 028-4 subunit polypeptide 
which is characterized in that it is a soluble secreted polypeptide having affinity for 
[ 3 H]gabapentin. Preferably, the soluble secreted polypeptide is derived from a mammal, 
20 more preferably a human 028-2, 028-3 or 028-4 subunit. 

The inventors have also found that it was possible to use a soluble secreted form of a 
voltage-dependant calcium channel 028-2, 028-3 or 028-4 subunit polypeptide in an assay 
for the screening of ligands which bind the 026-2, 028-3 or 028-4 subunit. 

25 

The invention therefore also concerns a method for the screening of ligands which bind a 
calcium channel 028-2, 028-3 or 028-4 subunit. 
The method comprises the steps of: 

- contacting a secreted soluble recombinant calcium channel 026-2, 028-3 or 028-4 
30 subunit polypeptide with: 

a ligand of interest; and 

a labelled compound which binds a 028-2, 028-3 or 028-4 subunit; and 

- measuring the level of binding of the labelled compound to the secreted soluble 
a 2 5-2, a 2 8-3 or 026-4 subunit. 
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The invention also concerns a kit for the screening of ligands which bind a calcium 
channel 0126-2, ot28-3 or ct2S-4 subunit. 
The kit comprises: 

a secreted soluble recombinant calcium channel a 2 8-2, 0:28-3 or a.28-4 
5 subunit polypeptide; and 

a labelled compound which binds a calcium channel 0:28-2, a 2 8-3 or 
0128-4 subunit. 

10 Detailed description of the invention 

The invention concerns truncated a 2 8-2, a 2 S-3 or a 2 8-4 subunit cDNA sequences. These 
truncated sequences encode soluble secreted polypeptides which retain their affinity for 
[ 3 H]gabapentin. 

15 

Throughout the present specification, the expression "nucleotide sequence" is used to 
designate indifferently a polynucleotide or a nucleic acid. More precisely, the expression 
"nucleotide sequence" encompasses the nucleic material and the sequence information and 
is not restricted to the sequence information (i.e. the succession of letters chosen among the 
20 four base letters) that biochemically characterizes a specific DNA or RNA molecule. 

As used interchangeably herein, the terms "oligonucleotides", "nucleic acids" and 
"polynucleotides" include RNA, DNA, or RNA/DNA hybrid sequences of more than one 
nucleotide in either single chain or duplex form. 

Further to its general meaning understood by the one skilled in the art, the term 
25 "nucleotide" is also used herein to encompass modified nucleotides which comprise at 

least one of the following modifications (a) an alternative linking group,.(b) an analogous 

form of purine, (c) an analogous form of pyrimidine, or (d) an analogous sugar. For 

examples of analogous linking groups, purines, pyrimidines, and sugars, see for example 

PCT publication N°WO 95/04064. 
30 The polynucleotide sequences of the invention may be prepared by any know method, 

including synthetic, recombinant, or a combination thereof as well as through any 

purification methods known in the art. 



A) Secreted a28-2, a28-3 or a28-4 subunit polypeptides 

35 The invention comprises polynucleotide sequences encoding a soluble secreted eukaryotic, 
preferably a soluble secreted mammal a 2 8-2, a 2 8-3 or a 2 8-4 subunit polypeptide. These 



sequences particularly include but are not restricted to 1) those sequences encoding a 
soluble secreted polypeptide of this 0126-2, 026-3 or 0C28-4 subunit which preferably retains 
its binding affinity for [ 3 H]gabapentin and 2) nucleotide fragments useful as nucleic acid 
primers or probes for amplification or detection purposes. 

The expression "soluble secreted 0126-2, 0:26-3 or 0:26-4 subunit" is intended to designate 
polypeptide sequences which, when produced by a recombinant host cell, are secreted at 
least partially into the culture medium rather than remaining associated with the host cell 
membrane. 

1) cDNA fra2ments encoding soluble secreted a?8-2, a?8-3, 038-4 subunit 
polypeptides 

One of the important embodiments of the present invention concerns truncated nucleotide 
sequences of 028-2, 0.28-3 or CC2S-4 subunit cDNAs which encode soluble secreted 026-2, 
028-3 or 028-4 subunit polypeptides. The inventors have found that it was possible to 
generate deletion mutants of 028-2, 028-3 or 028-4 subunit cDNAs which, when expressed, 
produce a significant amount of soluble secreted proteins, preferably soluble secreted 
proteins, which retain their [ 3 H]gabapentin binding affinity. These truncated nucleotide 
sequences of the invention are of significant value to the skilled person as they now allow 
fast and reliable access to significant concentrations of selected soluble secreted 028-2, 
028-3 or 028-4 subunit polypeptides. To that end, the inventors have determined the 
minimal and optimal fragment lengths required to express a polypeptide which: 1) binds 
[ 3 H]gabapentin with sufficient affinity and; 2) is obtained in a soluble secreted form. 
The discussion provided below provides comments on possible truncations, giving as an 
example the human 028-2, 028-3 or 028-4 subunit. However, given the very substantial 
cross-species homology for 028-2, 026-3 or 028-4 subunit sequences, the comments below 
can also be applied to other eukaryotic species, and more particularly other mammalian 
species such as rat, mouse, rabbit or pig. Their 028-2, 026-3 or 028-4 subunit sequences, 
which for most are available in public databases, share a very substantial homology with 
the human 028-2, 026-3 or 028-4 subunit sequences. 

The inventors believe tftat the soluble secreted 028-2 subunit polypeptides which are as 
close as possible to the native sequence and which are therefore more likely to retain their 
ative folding and hence their [ 3 H]gabapentin binding properties are those corresponding 

to the native protein in which anaino-acid stretch 1027 to the C-terminal end of 

the amino-acid sequence of SEQ ID N°20 has been deleted. The skilled scientist can quite 
easily determine within this amino-acid stretch the optimal 026-2 subunit polypeptides. 

The inventors also believe that the soluble secreted 026-3 subunit_polypeptides which are 
as close as possible to the native sequence and which are therefore more likely to retain 



their native folding and hence Vthir [ 3 H]gabapentin binding properties are those 

orresponding to the native protein nfwhich amino-acid stretch 984 to _C-terminal 

end of the amino-acid sequence olPSEQ ID N°22 has been deleted. The skilled scientist 

can quite easily determine within thi^ amino-acid stretch the optimal OC28-3 subunit 
polypeptides. 
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The invention therefore particularly concerns a nucleotide sequence encoding a 
olypeptide having atWst 80% identity with the polypeptide comprising from amino-acid 

1 to between amino-acios _1027 and _1 145 , preferably to between amino-acids 

_1062_and_1145 of SEQto N°20. 

Preferred nucleotide sequences\include those of SEQ ID N°l, SEQ ID N° 2 and SEQ ID 
N°3. 



The invention alss concerns a nucleotide sequence encoding a polypeptide having at least 
80% identity with the polypeptide comprising from amino-acid 1 to between amino-acids 

_984_ and 1085 \referably tobetween amino-acids 1019 and _1085 of SEQ ID 

N°22. \ 

Preferred nucleotide sequences include those of SEQ ID N°7, SEQ ID N° 8 and SEQ ID 
N°9. ^ 



B) Amplification of the soluble secreted q ? 5-2, 0,28-3 or 0*28-4 subunit nucleotide 
sequences 

Another object of the invention consists of a method for the amplification of a nucleic acid 
encoding a soluble secreted OC28-2, OC28-3 or ot25-4 subunit polypeptide, preferably a 
polypeptide bearing a [ 3 H]gabapentin binding site, said method comprising the steps of: 

(a) contacting a test sample suspected of containing the target 0128-2, cc.25-3 or ot28-4 
subunit nucleic acid, a fragment or a variant thereof, or a sequence complementary thereto, 
with an amplification reaction reagent comprising a pair of amplification primers which 
can hybridize under stringent conditions, the 0:28-2, 0:28-3 or ot25-4 subunit nucleic acid 
region to be amplified, and 

(b) optionally, detecting the amplification products. 

The expression [ 3 H]gabapentin binding site, when used herein is intented to designate a 
site which can bind either [ 3 H]gabapentin or other ligands such as (S+)-3-isobutyl gaba or 
(R-)-3-isobutyl gaba. 

In a first preferreaVembodiment of the above method, the nucleic acid encodes a secreted 
soluble a 2 8-2, a 2 8-3\ or a 2 S-4 subunit polypeptide of SEQ ID N°4, SEQ ID N°5, SEQ ID 
N°6, SEQ ID N°10, SEQ ID N°l 1, SEQ ID N°12, SEQ ID N°16, SEQ ID N°17 and SEQ 
IDn°18. 
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In a second preferred embodiment of the above amplification method, the amplification 
product is detected by hybridization with a labelled probe having a sequence which is 
complementary to the amplified region. 
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O Recombinant vectors and hosts cells for the expression of a secreted soluble 0.26-2, 
02 6-3 or OC25-4 subunit polypeptide 

1) Recombinant vectors 

he present invention also encompasses a family of recombinant vectors comprising any 
one of the nucleic acids described herein. Firstly, the invention deals with a recombinant 
vector comprising a nucleic acid selected from the group consisting of: 

(a) a purified or isolated nuc^eia acid encoding a a secreted soluble 0:28-2, 026-3 or 
OC25-4 subunit having at least 80% an$mo acid identity with the polypeptide of SEQ ID 
N°20 or 22, or a sequence complementaryVhereto; 

(b) a purified or isolated nucleicVcid having at least 90% nucleotide identity with a 
polynucleotide selected from the group consisting of the nucleotide sequences of SEQ ID 
N°l, SEQ ID N°2, SEQ ID N°3, SEQ ID NcV SEQ ID N°8, SEQ ID N°9, SEQ ID No 13, 
SEQ ID N°14, SEQ ID N°l 5 or a sequence complementary thereto; 

(c) a purified or isolated polynucleotide comprising at least 10 consecutive 
nucleotides of a nucleic acid described in (a) or (b) V a sequence complementary thereto. 
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In a first preferred embodiment a recombinant vector of the invention is used to amplify 
the inserted polynucleotide of the invention in a suitable host cell, this polynucleotide 
being amplified every time the recombinant vector replicates. 

Recombinant expression vectors comprising a nucleic acid encoding secreted soluble 026- 
2, a.25-3 or a28-4\ubunit polypeptides that are described in the present specification are 
ilso part of the invention. These include, but are not restricted to, nucleic acids encoding 
from amino-acid 1 toNjetween amino-acids 1027 and 1145, preferably between amino- 
acids 1062 and 1145 of SEQ ID N°20, as well as nucleic acids encoding from amino-acid 
1 to between amino-acids 984 andl085, preferably between amino-acids 1019 and 1085, of 
SEQ ID N°22. 

Another preferred embodiment of the recombinant vectors according to the invention 

s xonsist of expression vectors comprising a nucleic acid encoding 026-2, 026-3 or 026-4 

375 / subunit polypeptides of the invention, and more preferably a nucleic acid encoding a 
^ polypeptide selected from the group consisting of the amino acid sequences of SEQ ID 
N°4, SEQ ID N°5, SEQ ID JW SEQ ID N°10, SEQ ID N°ll, SEQ ID N°12, SEQ ID 
N°16, SEQ IDN 0 17 and SEQ ID\i 0 18. 
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Within certain embodiments, expression vectors can be employed to express the secreted 

soluble (X25-2, OC25-3 or 0128-4 subunit polypeptides which can then be purified and for 

example, be used as a immunogen in order to raise specific antibodies directed against said 

secreted soluble 0626-2, 028-3 or 0128-4 subunit polypeptides. 
5 Preferred eukaryotic vectors of the invention are listed hereafter as illustrative but not 
limitative examples: pcDNA3, pFLAG, pCMV-Script, pIND, pMClNEO, pHIL, 
pGAPZA, pMT/V5-His-TOPO, pMT/V5-His, pAc5.1/V5-HisA, pDS47/V5-His, pcDNA4, 
pcDNA6, pEFl, pEF4, pEF6, pUB6, pZeoSV2, pRc/CMv2, pcDM8, pCR3.1, pDisplay, 
pSecTag2, pVP22, pEMZ, pCMV/Zeo, pSinRep5, pCEP, pREP, pHook-l 

10 

Preferred bacteriophage recombinant vectors of the invention are PI bacteriophage vectors 
such as described by Sternberg N.L. (1992;1994). 

A suitable vector for the expression of a soluble secreted 0.28-2, 028-3 or 028-4 subunit 
polypeptide is a baculovirus vector that can be propagated in insect cells and in insect cell- 
15 lines. Specific suitable host vectors includes, but are not restricted to :pFastBac-l, pIZ/V5- 
His, pBacMan-1, pBlueBac4.5, pBlueBacHis2, pMelBacA, pVL1392, pVL1393 

The recombinant expression vectors from the invention may also be derived from an 
adenovirus such as those described by Feldman and Steig. (1996) or Ohno et al. (1994). 
20 Another preferred recombinant adenovirus according to this specific embodiment of the 
present invention is the human adenovirus type two or five (Ad 2 or Ad 5) or an 
adenovirus of animal origin (French Patent Application n°FR 93 05 954). 

a) Regulatory expression sequences 

25 Expression requires that appropriate signals are provided in the vectors, said signals 
including various regulatory elements such as enhancers/promoters from both viral and 
mammalian sources that drive expression of the genes of interest in host cells. The 
regulatory sequences of the expression vectors of the invention are operably linked to the 
nucleic acid a soluble secreted 028-2, 028-3 or 028-4 subunit polypeptide. 

30 As used herein, the term "operably linked" refers to a linkage of polynucleotide elements 
in a functional relationship. For instance, a promoter or an enhancer is operably linked to a 
coding sequence if it affects the transcription of the coding sequence. 
More precisely, two DNA molecules (such as a polynucleotide containing a promoter 
region and a polynucleotide encoding a desired polypeptide or polynucleotide) are said to 

35 be "operably linked" if the nature of the linkage between the two polynucleotides does 
not : (1) result in the introduction of a frame-shift mutation or (2) interfere with the ability 
of the polynucleotide containing the promoter to direct the transcription of the coding 
polynucleotide. 

Generally, recombinant expression vectors include origins of replication, selectable 
40 markers permitting transformation of the host cell, and a promoter derived from a highly 
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expressed gene to direct transcription of a downstream structural sequence. The 
heterologous structural sequence is assembled in an appropriate frame with the translation, 
initiation and termination sequences, and preferably a leader sequence capable of directing 
sequences of the translated protein into the periplasmic space or the extra-cellular medium. 

"5 In a specific embodiment wherein the vector is adapted for transfecting and expressing 
desired sequences in eukaryotic host cells, preferred vectors comprise an origin of 
replication from the desired host, a suitable promoter and an enhancer, and also any 
necessary ribosome binding sites, polyadenylation site, transcriptional termination 
sequences, and optionally 5 '-flanking non-transcribed sequences. 

\0 DNA sequences derived from the SV 40 viral genome, for example SV 40 origin early 
promoter, enhancer, and polyadenylation sites may be used to provide the required non- 
transcribed genetic elements. 

b) Promoter sequences 

15 Suitable promoter regions used in the expression vectors according to the invention are 
chosen taking into account the host cell in which the heterologous nucleic acids have to be 
expressed. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
controls the expression, or alternatively can be endogenous to the native polynucleotide 
20 containing the coding sequence to be expressed. 

Additionally, the promoter is generally heterologous with respect to the recombinant vector 
sequences within which the construct promoter/coding sequence has been inserted. 
Preferred eukaryotic promoters are the CMV, polyhidran or OPIE2. 

25 c) Recombinant host cells 

Host cells that have been transformed or transfected with one of the nucleic acids described 

herein, or with one of the recombinant vector, particularly recombinant expression vector, 
described herein are also part of the present invention. 

Are included host cells that are transformed (prokaryotic cells) or are transfected 
30 (eukaryotic cells) with a recombinant vector such as one of those described above. 
Preferred host cells used as recipients for the expression vectors of the invention are the 
following: 

(a) prokaryotic host cells: Escherichia coli, strains, (i.e. DH10 Bac strain) Bacillus 
subtilis, Salmonella typhimurium and strains from species such as Vseudomonas, 

35 Streptomyces and Staphylococcus; 

(b) eukaryotic host cells: HeLa cells (ATCC N°CCL2; N°CCL2.1; N°CCL2.2), Cv 
1 cells (ATCC N°CCL70), COS cells (ATCC N°CRL 1650; N°CRL 1651), Sf-9 cells 
(ATCC N°CRL 171 1), C127 cells (ATCC N°CRL-1804), 3T3 cells (ATCC N°CRL-6361), 
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CHO cells (ATCC N°CCL-61), human kidney 293 cells (ATCC N° 45504; N°CRL-1573), 
BHK (ECACC N°84100 501; N°841 11301), sf 9, sf 21 and hi-5 cells. 

D) Production of recombinant secreted soluble a;5-2, 0^5-3 or a?8-4 subunit 
polypeptides 

The present invention also concerns a method for producing one of the amino acid 



sequences describee! herein and especially a polypeptide selected from the group consisting 



of the aminoacid sequences of SEQ ID N°4, SEQ ID N°5, SEQ ID N°6, SEQ ID N°10, 
SEQ ID n°ll, SEQ Id\i 0 12, SEQ ID n°16, SEQ ID n°17or SEQ ID n°18 wherein said 
method comprises the step\of: 

(a) inserting the nucleic acid encoding the desired amino acid sequence in an 
appropriate vector; 

(b) culturing, in an appropriate culture medium, a host cell previously transformed 
or transfected with the recombinant vector of step (a); 

(c) harvesting the culture medium thus obtained or lyse the host cell, for example 

by sonication or osmotic shock; 

(d) separating or purifying, from said culture medium, or from the pellet of the 

resultant host cell lysate, the thus produced recombinant polypeptide of interest. 

In some instances, it is required to tag the secreted soluble ot25-2, ct25-3 or a.25-4 subunit 
polypeptide prior to purification. The tag is then in most instances encoded into the 
nucleotide sequence that is need to express the polypeptide. Examples of such tags include, 
but are not limited to sequences encoding C-myc, FLAG, a sequence of histidine residues, 
heamaglutin A, V5, Xpress or GST. Most of these tags can be incorporated directly into 
the sequence, for instance through PCR amplification by incorporating the appropriate 
coding sequence in one of the PCR amplification primers. However, the tag can also be 
introduced by other means such as covalent binding of the appropriate nucleic acid 
sequence encoding the tag moiety with the 3' or 5' end of the nucleic acid sequence 
encoding the polypeptide sequence. This is the case for GST. 

Purification of the recombinant secreted soluble ot28-2, OC28-3, oi25-4 subunit polypeptides 
according to the present invention is then carried out by passage onto a nickel or copper 
affinity chromatography column, such as a Ni NTA column or a Q-Sepharose column. 




In another embodiment of the above method, the polypeptide thus produced is further 
characterized, for example by binding onto an immuno-affinity chromatography column on 
which polyclonal or monoclonal antibodies directed to the secreted soluble CI28-2 subunit 
polypeptide of interest have been previously immobilised. 

5 

In another embodiment of the invention, the secreted soluble 0626-2, 0:28-3, ct25-4 subunit 
polypeptide can be only partially purified. For instance, it can be purified along with other 
contaminating proteins using an appropriate chromatography matrix such as an ion- 
exchange chromatography column. In such instances, it is not required to tag the desired 
10 polypeptide of interest. 

The most preferred embodiment contemplated by the inventors concerns the use of a 
purified tagged secreted soluble 0:28-2 , a 2 8-3 or a.28-4 subunit polypeptide. A particularly 
preferred tag is a nucleotide sequence encoding from 2 to 10, and preferably 6 histidine 
15 residues. 

With regard to the secreted soluble 012S-2 , 0128-3 or 0:28-4 subunit polypeptide used 
subsequently in the screening assay of the invention, several possibilities are also open to 
the skilled person. 

20 

In a first and preferred embodiment, the secreted soluble 012S-2, ct2S-3 or ot28-4 subunit 
polypeptide comprises a tag moiety which can be selected among the tags referred to 
above. Such tagged polypeptides are particularly useful in SPA or flashplate assays. A 
preferred tag is the nucleotide sequence encoding histidine residues referred to above. 

25 

In a second embodiment, the secreted soluble 0:28-2, 0C2S-3 or ot2S-4 subunit polypeptide 
can be used without a tag. This is the case for instance in SPA or flashplate assays 
comprising beads or plates coated with wheat germ lectin. In such an embodiment, the tag 
is not needed as the carbohydrate moieties of the secreted soluble 0:28-2, 0:28-3 or 0:28-4 
30 subunit polypeptide bind directly to the wheat germ lectin-coated beads or plates. 
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1) Labelled compounds which bind the secreted soluble a?8-2, ai5-3 or a75-4 
subunit polypeptide 

In cases where the CC26-2, ot25-3 or (X28-4 binding site is the [ 3 H]gabapentin binding site, 
the preferred labelled compound which can be used is of course gabapentin itself. 
5 However, gabapentin is not the only labelled compound which can be used in this context. 
Indeed, it has been previously demonstrated that saturation binding analyses on porcine 
synaptic plasma cerebral cortex membranes performed in the presence of L-leucine 
indicate a competitive interaction of the amino acid with the [ 3 H]gabapentin binding site, 
significantly reducing [ 3 H]gabapentin binding affinity for the site. The inventors believe 
10 that this competitive interaction is true across across all the amino-acids listed in table 1 
below. 



15 Table 1 

Binding affinities of selected amino acids (IC50 <500nM) for the [ 3 H]gabapentin site in 
porcine cortical membranes 





Compound 


ICsn fNM) ARITHMETIC MEAN (N=3) ± S.E.M. 


20 


Gabapentin 


42.1 ±5.5 




L-Norleucine 


23.6 + 6.7 




L-Allo-Isoleucine 


32.8 ±6.0 




L-Methionine 


49.6 ± 10.0 




L-Leucine 


61.3 ±20.9 


25 


L-Isoleucine 


68.8 ± 1.9 




L- Valine 


330 ± 18 




L-Phenylalanine 


351 ±89 



It is therefore possible to use commercialy available labelled forms of these high affinity 
30 ligands in replacement of gabapentin. The utility of [ 3 H]L-leucine has been demonstrated 
in a filter binding assay and in a flashplate assay format. The inventors believe that labelled 
amino acids but also other compounds, with affinities preferably below 500 nM in the 
binding assay can be used as replacements of gabapentin. 



1 
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With regard to the label, several embodiments can be used in the context of the invention. 
Preferred labels are of course radioactive labels, a list of which is provided further in this 
specification. 

5 

2) Assay formats and conditions 

Several assay formats can be used to carry out the method of the present invention. 

Preferred assay formats include scintillation assays such as the scintillation proximity 
assay (SPA) or the flashplate assay. Other assay formats well known to those skilled in the 
10 arts such as the filter binding assay and the centrifugation assay are also contemplated in 
the present invention. 

SPA and flashplate assays are preferred assay formats for the present invention. Additional 
details on these assays are provided below. 

15 Scintillation assay format 

Scintillation assays technology either involves the use of scintillant beads (for the SPA 

assay) or plates (for the flashplate assay). SPA beads are usually made from either cerium- 
doped yttrium ion silicate (y2Si05:Ce) or polyvinyltoluene (PVT) containing an organic 
scintillant such as PPO. Flashplates commonly used are those such as Ni chelate 
20 flashplates although other flashplates can also be used. 

Assays are usually carried out in aqueous buffers using radioisotopes such as 3 H, 125 1, 14 c, 
35 S or 33 P that emit low-energy radiation, the energy of which is easily dissipated in an 
aqueous environment. For example, the electrons emitted b y 3 H have an average energy 

25 of only 6 keV and have a very short path length (-1 ~tm) in water. If a molecule labelled 
with one of these isotopes is bound to the bead or flashplate surface, either directly or via 
interaction with another molecule previously coupled to the bead or flashplate, the emitted 
radiation will activate the scintillant and produce light. The amount of light produced, 
which is proportional to the amount of labelled molecules bound to the beads, can be 

30 measured conveniently with a liquid scintillation (LS) counter. If the labelled molecule is 
not attached to the bead or a flashplate surface, its radiation energy is absorbed by the 
surrounding aqueous solvent before it reaches the bead, and no light is produced. Thus, 
bound ligands give a scintillation signal, but free ligands do not, and the need for a time- 
consurning separation step, characteristic of conventional radioligand binding assays, is 
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eliminated. The manipulations required in the assays are reduced to a few simple pipetting 
steps leading to better precision and reproducibility. 

The conditions under which SPA and flashplate assays are performed in the context of the 
present invention are provided below. 

5 

Scintillation assay conditions 
a) SPA assay 

The SPA assays is first developed to optimize the conditions under which the radioligand 
binds the secreted soluble 0126-2, 0126-3 or ct25-4 subunit polypeptide. The parameters 
10 which can be varied to optimize radioligand binding in a typical SPA assay using 
Amersham beads include assay temperature, 0.28-2, 0.28-3 or a.28-4 subunit polypeptide 
interaction with the radioligand and the SPA beads, radioligand concentration as well as 
pH variations. 

15 The temperature at which the assay can be carried out can vary from 1 to 30°C. Preferred 
temperatures range from 18 to 23 °C, with 21°C being the most preferred temperature. The 
interaction of the OC26 subunit polypeptide with the SPA beads can be optimized by 
adjusting the concentration of the polypeptide and by introducing a reagent which will 
favor this interaction. When 50 mg of Amersham SPA beads are used, the 0:26-1 subunit 

20 polypeptide concentration may vary from 0.1 to 10 pmoles per well, with the optimal 
concentration being generally around 5 to 6 pmoles per well. 

As for the reagent favoring the interaction between the secreted soluble 0:28-2, 0.28-3 or 
ot25-4 subunit polypeptide and the radioligand as well as the Amersham SPA beads, the 
25 inventors found that imidazole could be efficiently used for that purpose when the 0:26-2, 
0:26-3 or 0:28-4 subunit polypeptide is tagged with an amino acid sequence including 6 
histidine residues. Furthermore, and more importantly, it was found that imidazole also 
enhanced binding of the radioligand to the a 2 8-2, a 2 5-3 or a 2 8-4 polypeptide. 

30 The concentration of the radioligand is evaluated with respect to the concentration of 
secreted soluble 012S-2, 028-3 or 028-4 subunit polypeptide present in the assay medium. 
Generally, the concentration of radioligand varies from 1 nM to 100 nM. A preferred 
[ 3 H]gabapentin concentration is about 5 to 20 nM, with a most preferred concentration 
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being about 10 nM. A preferred [ 3 H]leucine concentration is also about 5 to 20 nM, with a 
most preferred concentration being about 10 nM. It is to be noted that the concentration of 
other radioligands having affinities similar to those of [ 3 H]gabapentin and [ 3 H]leucine 
should also be in the range of about 5 to 20 nM. 

Once the optimal radioligand bincW» conditions have been determined, a test ligand can be 
introduced in the assay medium to evaluate the level of displacement of the radioligand. 
The concentration of test ligand to be ififcboduced in the assay medium usually varies from 
0. 1 nM to about 100 uM. A preferred test Hgand concentration of about 10 uM is usually a 
starting point in a high throughput screenings assay. Then, depending on the number of hits 
obtained, it may be lowered or increased. 

It is to be noted that the parameters set forth above, which have been evaluated for a 
typical SPA assay using Amersham SPA beads can be adjusted by the skilled person, for 
example if SPA beads of a different type are used. 

b) Flashplate assay 

Similarly to the SPA assays, the flashplate can first be developed in order to optimize the 
conditions under which the radioligand binds the a 2 S subunit polypeptide. The parameters 
which can be varied to optimize radioligand binding in a typical flashplate assay using 
NEN Ni chelate flashplates also include assay temperature, secreted soluble 0126-2, a 2 8-3 
or a 2 S-4 subunit polypeptide interaction with both the radioligand and the flashplates, 
radioligand concentration as well as pH variations. 

The temperature at which the assay can be carried out can vary from 1 to 30°C. Preferred 
temperatures range from 18 to 23 °C, with 21°C being the most preferred temperature. 

The interaction of the secreted soluble ot 2 8-2, a 2 5-3 or a 2 8-4 subunit polypeptide with the 
flashplates can be optimized by adjusting the concentration of the polypeptide and by 
introducing a reagent which will favor this\fteraction. When a standard NEN Ni chelate 
ashplate is used, the secreted soluble a 2 8-2\cWj-3 or a 2 8-4 subunit polypeptide volume 
usually varies between 0.5 and 20 ul for a concentration of secreted soluble a 2 5-2, a 2 8-3 or 
a 2 8-4 subunit polypeptide of 0.6 pmol/ul. As the published maximum binding capacity of 
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NEN p plates is about 6 pmol per well, \ke inventors consider that an optimal concentration 
of secreted soluble 026-2, 0126-3 or 02$^ subunit polypeptide is probably around 5 pmol 
per well at 8 ul. 



5 With regard to the reagent fWoring the interaction between the secreted soluble 0.26-2, 
026-3 or a 2 8-4 subunit polypeptide and the radioligand as well as the flashplates, the 
inventors believe that imidazole ctould also be efficiently used for that purpose when the 
secreted soluble 026-2, 026-3 or O20r4 subunit polypeptide is tagged with an amino acid 
sequence including 6 histidine residues. The inventors also believe that imidazole 
10 concentrations can substantially enhjan^ed binding of the radioligand to the secreted 
soluble 026-2, 026-3 or 026-4 polypeptide.Vhe optimal concentration of imidazole used to 
enhance radioligand binding varies depending on the concentration of secreted soluble 
026-2, 026-3 or 026-4 subunit polypeptide used in the assay. For instance, when the 
volume of the 026-1 subunit polypeptide is about 10 ul ul (a 2 5-2, 026-3 or 026-4 
15 polypeptide concentration of 0.6 pmol/ul), the optimal imidazole concentration can vary 
between 1 and 20 mM, with a concentration of Vbout 10 mM being preferred. As 
mentioned previously, other compounds such as histidine as well as pH variations may be 
used to enhance radioligand binding. 

20 The concentration of the radioligand is evaluated with respect to the concentration of 026- 
2, a 2 6-3 or a 2 6-4 subunit polypeptide present in the assay medium. Generally, the 
concentration of radioligand varies from 1 nM to 100 nM. A preferred [ 3 H]gabapentin 
concentration is about 5 to 20 nM, with a most preferred concentration being about 10 nM. 
A preferred [ 3 H]leucine concentration is also about 5 to 20 nM, with a most preferred 

25 concentration being about 10 nM. It is to be noted that the concentration of other 
radioligands having affinities similar to those of [ 3 H]gabapentin and [ 3 H]leucine should 
also be in the range of about 5 to 20 nM. 

Once the optimal radioligand binding conditions have been determined, a test ligand can be 
30 introduced in the assay medium to evaluateMhe level of displacement of the radioligand. 

le concentration of test ligand to be intromited in the assay medium usually varies from 
^ y0. 1 nM to about 100 uM. A preferred test ligand\concentration of about 10 uM is usually a 
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starting point in a high throughput screening assay. Then, depending on the number of hits 
obtained, it may be lowered or increased. 

The inventors have tested the displacement of a particular radioligand, [ 3 H]gabapentin, 
with (S+)-3-isobutly gaba. The data provided in the examples which follow clearly shows 
that the assay can be used in high throughput competition studies. 

E) Purified recombinant secreted soluble ^5-2, aib-3 or a?5-4 polypeptides 

Another object of the present invention consists of a purified or isolated recombinant 
polypeptide comprising the amino acid sequence of a secreted soluble 0626-2, ct25-3 or 0:26- 
4 subunit polypeptide. 

Preferred isolated recombinant polypeptides of the invention include those having at least 
80%, preferably 90%\ more preferably 95, and most preferably 98 or 99%, amino-acid 
'dentity with polypeptides comprising from amino acid 1 to between amino-acids 1027 and 
145, preferably between\mino-acids 1062 and 1145 of SEQ ID N°20, as well as those 
having at least 80%, preferabV 90%, more preferably 95, and most preferably 98 or 99%, 
amino-acid identity with polypeptides comprising from amino acid 1 to between amino- 
acids 984 and 1085, preferably beWeen amino-acids 1019 and 1085 of SEQ ID N°22. 

In a further preferred embodiment, the polypeptide comprises an amino acid sequence 
having at least 80%, preferably 90^ more preferably 95%, and most preferably 98% or 
99% amino acid identity with the arffino^acid sequence of SEQ ID N°4, SEQ ED N°5, SEQ 
ED N°6, SEQ ED N°10, SEQ ID N°ll, SEQ ID N°12, SEQ ED N°16, SEQ ID N°17 and 
SEQ ED N° 19. 

F) Modified secreted soluble a?5-2. ob5-3 or a?5-4 subunit polypeptides 

The invention also relates to secreted soluble a 2 8-2, oi25-3 or a 2 8-4 subunit polypeptide 
comprising amino acid changes ranging from 1, 2, 3, 4, 5, 10, 20, 25, 30, 35, 40 
substitutions, addiWms or deletions of one amino acid as regards to polypeptides of anyone 
of the amino acid sequences of the present invention. Preferred sequences are those of SEQ 
ED N°4, SEQ ED N°5, SEQ ED N°6, SEQ ED N°10, SEQ ID N°l 1, SEQ ED n°12, SEQ ED 
n°16, SEQ ED n°17 and s\q ED N°18. 

In the case of an amino acid substitution in the amino acid sequence of a polypeptide 
according to the invention, one or several consecutive or non-consecutive amino-acids are 
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replaced by "equivalent" amino-acids. The expression "equivalent" amino acid is used 
herein to designate any amino acid that may be substituted for one of the amino-acids 
belonging to the native protein structure without decreasing the binding properties of the 
corresponding peptides to the antibodies raised against the polypeptides of the invention. 
5 In other words, the "equivalent" amino-acids are those which allow the generation or the 
synthesis of a polypeptide with a modified sequence when compared to the amino acid 
sequence of the secreted soluble 0128-2, 0C28-3 or ot2S-4 subunit polypeptides of interest, 
said modified polypeptide being able to bind to the antibodies raised against the secreted 
soluble OC25-2, 0128-3 or OC28-4 subunit polypeptide of interest and/or to induce antibodies 

10 recognizing the parent polypeptide. 

Alternatively, amino acid changes encompassed are those which will not abolish the 
biological activity of the resulting modified polypeptide. These equivalent amino-acids 
may be determined either by their structural homology with the initial amino-acids to be 
replaced, by the similarity of their net charge or of their hydrophobicity, and optionally by 

15 the results of the cross-immunogenicity between the parent peptides and their modified 
counterparts. 

The peptides containing one or several "equivalent" amino-acids must retain their 
specificity and affinity properties to the biological targets of the parent protein, as it can be 
assessed by a ligand binding assay or an ELISA assay. 

20 Examples of amino-acids belonging to specific classes include Acidic (Asp, Glu), Basic 
(Lys, Arg, His), Non-polar (Ala, Val, Leu, He, Pro, Met, Phe, Trp) or uncharged Polar 
(Gly, Seu, Thr, lys, Tyr, Asn, Gin) amino-acids. 

Preferably, a substitution of an amino acid in 0.26-2, 0.28-3 or 028-4 subunit polypeptide 
of the invention, or in a peptide fragment thereof, consists in the replacement of an amino 

25 acid of a particular class for another amino acid belonging to the same class. 

By an equivalent amino acid according to the present invention is also contemplated the 
replacement of a residue in the L-form by a residue in the D form or the replacement of a 
Glutamic acid (E) residue by a Pyro-glutamic acid compound. The synthesis of peptides 
containing at least one residue in the D-form is, for example, described by Koch (1977). 

30 A specific embodiment of a modified peptide of interest according to the present invention, 
includes, but is not limited to, a peptide molecule, which is resistant to proteolysis. This is 
a peptide in which the -CONH- peptide bond is modified and replaced by a (CH 2 NH) 
reduced bond, a (NHCO) retro inverso bond, a (CH 2 -0) methylene-oxy bond, a (CH 2 S) 
thiomethylene bond, a (CH 2 CH 2 ) carba bond, a (CO-CH 2 ) cetomethylene bond, a (CHOH- 

35 CH 2 ) hydroxyethylene bond), a (N-N) bound, a E-alcene bond or also a -CH=CH-bond. 
The invention also encompasses secreted soluble a 2 8-2, CC25-3 or a 2 8-4 subunit 
polypeptide in which at least one peptide bond has been modified as described above. 
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The polypeptides according to the invention may also be prepared by the conventional 
methods of chemical synthesis, either in a homogenous solution or in solid phase. As an 
illustrative embodiment of such chemical polypeptide synthesis techniques, it may be cited 
the homogenous solution technique described by Houbenweyl (1974). 
5 The secreted soluble OC28-2, ot25-3 or a.28-4 subunit polypeptide of interest, or a fragment 
thereof may thus be prepared by chemical synthesis in liquid or solid phase by successive 
couplings of the different amino acid residues to be incorporated (from the N-terminal end 
to the C-terminal end in liquid phase, or from the C-terminal end to the N-terminal end in 
solid phase) wherein the N-terminal ends and the reactive side chains are previously 
10 blocked by conventional groups. 

For solid phase synthesis, the technique described by Merrifield (1965a; 1965b) may be 
used in particular. 

G) Antibody production 

15 

The secreted soluble ot28-2, ot25-3 or 0C2S-4 subunit polypeptides of the invention and their 
peptide fragments of interest can be used for the preparation of antibodies. 
Polyclonal antibodies may be prepared by immunization of a mammal, especially a mouse 
or a rabbit, with a polypeptide according to the invention that is combined with an adjuvant 
20 of immunity, and then by purifying the specific antibodies contained in the serum of the 
immunized animal on an affinity chromatography column on which has previously been 
immobilized the polypeptide that has been used as the antigen. 

Monoclonal antibodies may be prepared from hybridomas according to the technique 
described by Kohler and Milstein (1975). 
25 The present invention also deals with antibodies produced by the trioma technique and by 
the human B-cell hybridoma technique, such as described by Kozbor et al. (1983). 
Antibodies of the invention also include chimeric single chain Fv antibody fragments (US 
Patent N° 4,946,778; Martineau et al., (1998), antibody fragments obtained through phage 
display libraries Ridder et al. (1995) and humanized antibodies (Leger et al., (1997)). 

30 

H) Screening assays 

The invention concerns a method for the screening of ligands which bind soluble secreted 
a.25-2, ot2S-3 or 012S-4 subunit polypeptide. More particularly, the targeted OC28-2, a.28-3 or 
0128-4 subunit binding site is preferably the [ 3 H]gabapentin binding site. The various 
35 parameters of the method of the invention are described in further detail below. 
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EXAMPLES 
5 Examples 

Construction of a nucleotide sequence encoding a soluble secreted human q?5-2 
subunit pofypeptide deletion mutant of SEP ID N°23 

a) Primer design 

iu PCR primers wWe designed to generate the secreted soluble human 0C28-2 deletion mutant 
of SEQ ID N° 23yas follows: 

5' PCR primer: This was designed to engineer in a KOZAK translation initiation 
consensus sequence prior to the coding sequence (Kozak JBC 266 19867-19870) 
3' PCR primer: This was designed to engineer in six histidine residues followed by a stop- 
15 codon at the desired location in the coding sequence. In addition to the stop codon the 0:28- 
2 primers also included an Eco RI restriction site. 



The bold region in each primer sequence denotes the 'tagged' region; addition of 
sequences not present in the template. Primers were custom synthesized by Perkin Elmer 
20 Applied Biosystems UK to the ABI ready pure grade, supplied lyophilized then 

resuspended to 15uM in lOmM TE. JB197 and 198 were provided with 5' phosphate 
groups: 

5' Primer JB197\5' -TCGCCACCATGGCGGTGCCGGCTC-3' , SEQ ID N°25) 

•^yV Primer JB 198 (5'^cggaattcctcagtgatggtgatggtgatgggccccgcggccacagtc-3' , seq id 

N°28) 
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b) Protocol for PCR mediated 5' Kozak and 3' 6His tagging of human OC28-2 



The full length human ct28-2 gene (Gen Bank Accession Number AF042792) in a pcDNA 
3 vector as described in Brown, J.P. and Gee,N.S., Cloning and deletion mutagenesis of the 
ct28 calcium channel subunit from porcine cerebral cortex, The journal of biological 
chemistry, 273(39):25458-25465) was used as the template in the following PCR reaction. 
35 The reagents were added in the following order in triplicate to a 96 well PCR plate: 

jul 

1 Ox Pfx A mpliflcation buffer 5 

WmMdNTPs 1.5 
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50mM MgS0 4 



1 



15uMJB197 



1.5 



15uMJB198 



1.5 



1 OOng/ul pcDNA3 . 1 -humans-a 2 8-2 1 



5 lOxPCR Enhancer 



5 



H 2 0 



32.7 



2. 5 UNITS/ Pfx polymerase 



0.8 JlL 



The plate was the cycled on an MJ Tetrad DNA engine according to the following cycling 
10 conditions: 

94°C / 2mins 
followed by: 

for 30 cycles 94°C/45sec 

15 58°C/45sec 



The 3366bp product was then gel purified from a 1% TAE agarose gel using QIAEX beads 
and eluted in approximately 50[i\ TE. 



Example 2 

Cloning of the PCR fragments of Example 1 into the Baculovirus transfer vector 
pFastBacl 

30 The PCR products of Example 1 were cloned into Slu I digested, calf intestinal 

phosphatase dephosphorylated, phenol chloroform extracted and QIAEX gel purified 
pFastBacl (Life Technologies) using the Rapid DNA ligation kit (Roche Diagnostics) 
transforming XLl-blue (ot25-lb) E. Coli cells: 



68°C / 4mins 



followed by: 
68°C/ lOmins 
followed by: 



20 hold at 4°C 



25 
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a) Screening for positive recombinants 

Given that the PCR product was cloned by blunt-end ligation a screen was required to 
select a recombinant with the gene ligated in the positive orientation with respect to the 
polyhedrin promoter in pFastBacl. This was achieved by restriction digest of miniprep 
DNA (Qiagen miniprep kit) prepared from colony minicultures and analysis on a 1% TAE 
agarose gel. A positive clone was identified according to the following digest patterns: 



^jJ7~~^EQ ID N° 23. in pFastBacl 
P^TY-TO /Eco RI digest performed on miniprep DNA 



PCR product cloned in a positive orientation 
PCR product cloned in a negative orientation 



Predicted fragments (bp) 
4773 and 3368 
8127 and 14 



15 b) Sequencing analysis of selected clones 

One positive was selected for this clone and used to prepare a plasmid DNA stock of the 
desired construct (QIAGEN maxi kit). Confirmatory sequence reactions were performed 
using the Big Dye terminator sequencing kit and run on an ABI 310 Prism Genetic 
Analyzer. Sequence analysis of both coding strands was performed using a selection of 
20 sequencing oligonucleotide primers. 




Examhle 3 

'\$ } Protocol for establishing baculovirus banks for the expression of the a i 5-2 deletion 
mutant S\EQ ID N°23 

Essentially, the protocol used to generate the baculovirus banks is that outlined in the Life 
Technologies Bac-to Bac™ baculovirus expression systems manual. 



a) Transposition oKDHlOBac E Coli cells 
30 One ng (5uX) of the recombinant pFastBac-1 construct containing the nucleotide sequence 

encoding the porcine a 2 8^2 deletion mutant of SEQ ID N°23 was added to lOOul of 

/DHlOBac cells thawed on ic^The cells were then mixed gently by tapping the tube then 
' incubated on ice for 30minutesNjefore heat shock treatment by incubation in a 42°C water 
bath for 45 seconds. The mixture Vas then chilled on ice for 2 minutes before the addition 
35 of 900u.l of S.O.C. medium. The mixture was then placed in a shaking incubator (200rpm) 
at 37°C for 4hours. The cells were the\ serially diluted (10 fold dilutions from 10" 1 to 10" 3 ) 
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and lOul of each dilution plated on LB agar plates containing 50|j.g/ml kanamycin, 7ng/ml 
gentamicin, lOug/ml tetracycline, 100u.g/ml Bluo-gal and 40ug/ml IPTG. The plates were 
incubated at 37°C for between 1 and 3 days until discrete colonies of blue and white colour 
were discernible. 

5 

b) Isolation of recombinant DNA 

White colonies (containing the recombinant bacmid) were picked and grown for 24 hours 
(to stationary phase) at 37°C with shaking (200rpm) in 2ml of LB containing 50ug/ml 
kanamycin, 7ug/ml gentamicin and lOug/ml tetracycline. 1.5ml of culture was then 

10 transferred to a microfuge tube and centrifuged at 14,000xg for lminute. The supernatant 
was removed and the cells resuspended gently in 0.3ml of 15mM Tris-HCl (pH8.0), lOmM 
EDTA, lOOug/ml RNase A. 0.3ml of 0.2N NaOH, 1% SDS was then added and the 
mixture mixed gently before incubation at 22°C for 5 minutes. Then 0.3ml of 3M 
Potassium acetate (pH5.5) was added and the sample placed on ice for 10 minutes. After 

15 centrifugation at 14,000xg for 10 minutes the supernatant was transferred to a tube 
containing 0.8ml of isopropanol, mixed then placed on ice for 10 minutes before 
centrifugation at 14,000xg for 10 minutes. The supernatant was then discarded and the 
pellet rinsed with 0.5ml of 70% ethanol before centrifugation at 14,000xg for 5 minutes. 
This 70% ethanol rinse was then repeated before removing all of the supernatant and air 

20 drying the pellet for lOminutes at room temperature. The pellet was finally resuspended in 
40ul of TE. 

c) Transfection of sf9 cells with the recombinant bacmid DNA 

A 6-well tissue culture plate was seeded with 0.9xl0 6 sf9 cells (cells at log phase having 
25 grown from a culture passaged at 0.3xl0 6 cells/ml) per 35mm well in 2ml of Sf-900 II 
SFM media containing 50units/ml penicillin and 50|j.g/ml streptomycin. Cells were left to 
attach at 27°C for 1 hour. Bacmid DNA prepared as described above (5ul) was added to 
200ul of Sf-900 II SFM media containing 6ul of CELLFECTIN and mixed before 
incubation at room temperature for 45 minutes. The cells were washed once with 2ml of 
30 Sf-900 II SFM media without antibiotics then 0.8ml of Sf-900 II SFM media was added to 
each tube containing the lipid-DNA complex. The wash buffer was removed from the cells 
and the 1ml of diluted lipid-DNA complex overlaid on the cells. The cells were incubated 
for 5hours at 27°C after which time the transfection mixture was removed and 2ml of Sf- 



900 II SFM media containing 50units/ml penicillin and 50u.g/ml streptomycin was added. 
The cells were then incubated for 72 hours. 



After incubation for 72 hours the media was removed from the cells and centrifuged at 
5 500xg for 5 minutes. The supernatant was then transferred to a fresh tube, this was labelled 
as the PO bank and stored at 4°C in the dark. The PI bank was prepared by passaging sf9 
cells at approx 5xl0 6 cells/ml to 2xl0 6 cells/ml (100ml in a 250ml Erlenmeyer flask) and 
adding 0.5ml of the PO bank harvested above. The cells were then incubated shaking 
(200rpm) at 27°C for 4 days. Under sterile conditions the culture was centrifuged at 500xg 
10 for 10 minutes and the supernatant 0.2uM filtered (PI bank). The P2 bank was prepared by 
adding 2ml of PI bank per 400ml culture (in 1L Erlenmeyer flasks) passaged as above to 
2xl0 6 cells/ml. The culture was incubated as before for 4 days and the supernatant 
harvested and filtered as described for the PI bank. The supernatant was first pooled then 
ali quoted (10ml) and stored at 4°C. 
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xample 4 

/Expression of the a?5-2 deletion mutant of SEP ID N°23 



To sf9 cells passaged from ~5xl0 6 cells/ml to 2x106 cells/ml in Sf-900 II SFM media was 
20 added 0.1ml virus per 100ml of cells of the appropriate viral bank (400ml volumes in 1L 
Erlenmeyer flasks). The cells were then cultured for 4-5 days at 27°C with 1 lOrpm 
shaking. Expression of the protein was confirmed by SDS-PAGE and Western blotting 
using an anti penta-His monoclonal antibody (Qiagen) and was detected in the culture 
supernatant and cell lysate. 

25 

Example 5 

urifi^ation of q 7 5-2deletion mutant of SEP ID N°23 

The-a 2 8-2\deletion mutant of SEQ ID N°23 was purified from the cell lysate following the 
30 purification strategy outlined below: 




The culture was centrifuged at 6,000xg for 10 minutes and the supernatant removed. The 
weight of the cell pellet was determined before re-suspension in 20mM Tris pH8.0, 
lOOmMKCl, 1% P40-Nonidet (100ml per 20g of wet cells). A protease inhibitor cocktail 
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(Sigma, Cat# P8849), lml/L, was added to the mixture. The solution was then stirred for 
10 minutes before centrifugation for lhour at 30,000xg and 4°C. The supernatant was 
concentrated (30kDa cut off) to approx. ~300ml then centrifuged for lhour at 100,000xg. 

Supernatant containing the soluble proteins was diluted 1:3 in lOmM Tris-HCl pH8.0 
(equilibration buffer) and loaded onto a pre-equilibrated Q-Sepharose column (2.5cm i.d. x 
30cm h.) at a flow rate of 900ml/h. After washing with equilibration buffer until a stable 
A 2 80nm baseline had been achieved, protein was eluted with 20mM Tris-HCl pH8.0, 0.5M 
KC1, lOmM Imidazole.. 

The eluate was'then loaded onto a Ni-NTA (Qiagen) column (2.5cm i.d. x 6cm h.) pre- 
equilibrated in 20rhM Tris pH8.0, 0.5M KC1, lOmM Imidazole at a flow rate of 2 ml/min. 

he column was washed successively with buffer A (20mM Tris pH8.0, 0.5M KC1, 20mM 
Imidazole), buffer B (lobmM Tris-HCl pH8.0, 1M KC1), and buffer A again. Elution was 
performed with buffer C (2(Wl Tris-HCl pH8.0, lOOmM KC1, 0.5M Imidazole). The Ni- 
NTA eluate (~50ml) was conceWated (30kDa cut-off) to ~2ml and applied at 1 ml/min and 
in 0.2ml aliquots, to an FPLC Sup&dex-200 column equilibrated in lOmMHEPES, pH7.4, 
150mMNaCl. Fractions containing thXpolypeptide of SEQ ID N°23 were pulled. 



SPA a&ay of T - Hlgabapentin binding to the secreted soluble humana 2 5-2 subunit of 



Example 6 
SPA asW 
SEP ID V23 

The assay is carried out at 21°C. Assay components are added in the following order (all 
reagents are diluted in lOmM HEPES (pH 7.4 at 21°C) to 96-well Optiplates: 

25ul imidazole at various concentrations (diluted from a 1M stock 
pH8.0, see assay details) 

50(al lOmM HEPES pH 7.4 

25 ul (50mg) SPA beads (Amersham) 

lOOul s-a 2 5-2 subunit polypeptide of SEQ ID No 23 (2ul protein 

diluted to lOOul) 

25 ul radioligand ([ 3 H]gabapentin obtained from example 5 
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Immediately after adding radioligand, the optiplates were loaded in the Packard Top Count 
scintillation counter to follow the binding time course. Imidazole was first used in the 
assay to optimize the specific interaction of the protein's 6His tag with the SPA bead. 
Imidazole itself (up to lOOmM) in the filtration assay has no effect on [ 3 H]gabapentin 
5 binding (n=l). 
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Example 7 

Ni Flashplate assay of f - Hlgabapentin binding to secreted soluble human a?5-2 (SEP 
ID N°23) \ 

10 Assays are carried out at 21°C in a final volume of 250\x\ in 96-well NEN Ni chelate flash 
plates. Assay components are added in the following order (all reagents were diluted in 
lOmM HEPES (pH 7.4 at 21°C)): 

25ul lOmM HEPES pH7.4 

25ul imidazole at various concentrations (diluted from a 1M stock 
15 pH8.0, see assay details) 

75ul lOmM HEPES pH 7.4 

lOOul s-ot25-2-6His (2ui protein diluted to 100^1) obtained from 
example 5 

25 radioligand ([ 3 H]gabapentin (65Ci/mmole) 
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Immediately after adding the radioligand, flash plates are loaded in the Packard Top Count 
scintillation counter to follow the binding time course. The '[ 3 H] flash plate' programme 
(cpm) is used to monitor activity. Imidazole is first used in the assay to optimize the 
specific interaction of the protein's 6His tag with the Ni flashplate. 



Example 8 

Ni Flashplate assay of [ - HILeucine binding to secreted soluble human a25-2-6His 




30 The procedure described in example 7 is/r^oeated, except that [ 3 H]gabapentin is replaced 
jby 25 ul (10.1 nM) of [ 3 H]Leucine (141 Ci/rn\ole). 

57 
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Example 9 

<ja ^Ni Flasiitalate assay studying competitive binding of [ - Hlgabapentin and (S+V3- 
\Y/ isobutvl G^BA to human q ? 5 2-6His (SEP ID N°23). 

5 Assays are carried out at 21 °C in a final volume of 250u.l in 96-well NEN Ni chelate flash 
plates. Wells are set up for both 'total' and 'non-specific' binding. Specific binding is 
defined as that remaining after subtraction of the average of the 'non-specific binding' 
values from the average of the ' total' binding values. Assay components are added in the 
following order (all reagents were diluted in lOmM HEPES (pH 7.4 at 21°C)): 
10 _ 25ul lOmM HEPES OT7.4 or 25 ulothe test compound at the 

J appropriate concentration in HEPES 

& / 25u.l 200 mM imidazole (diluted from a 1M stock pH8.0, see 
assay details) 
Total binding 75ul lOmM HEPES pH 7.4 

15 Non-specific binding 50ul lOmM HEPES pH 7.4 and 25ul lOOuM (S+)-3-isobutyl 

GAB A 

lOOfj-1 a 2 5-2-6His (2ul protein* diluted to lOOul) 
25(^1 radioligand ([ 3 H]gabapentin or [ 3 H]Leucine) 



20 * The source of a 2 5-2-6His is that purified by fplc Superdex-200 gel filtration (see 
example 5) 

Immediately after adding radioligand, flash plates are loaded in the Packard Top Count 
scintillation counter to follow the binding time course. Incubation time before the assay is 
3 hours. The '[ 3 H] flash plate' programme (cpm) is used to monitor activity 
25 Competition studies are compared across the flash-plate and filter binding methodologies 
in order to validate the new assay technology with the established filter binding 
methodology. 
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GraphPad Prism software is used to process competition curve data and determine IC50 and 
hill slope values. Twelve point competition curves with half log dilution steps of test 
compounds are used in the experiments. 
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xamme 10 

ilter bidding assay of [ - Hlgabapentin binding to the recombinant polypeptide of 
SEP ID isfo23 

Assays were carried out at 21°C in a final volume of 250u.l in 96-deep well plates. Assay 
components were (all reagents were diluted in lOmM HEPES (pH 7.4 at 21°C)): 
25 ul compound to test 

200ul Polypeptide of SEQ ID N°23 (3 ul protein diluted to 200ul) 
25|il radioligand ([ 3 H]gabapentin (65Ci/mmole) 

Plates were incubated at room temperature for lh prior to filtering on to 96-well GF/B 
Unifilter plates pre-soaked in 0.3% polyethylenimine. Filters were washed with 3x1 ml 
50mM Tris-HCl (pH 7.4 at 4°C), and dried over-night. Scintillant (Microscint O, 50ul) was 
added and the plates counted using a Packard Top Count scintillation counter. Specific 
binding was -98% of the 'total' value. In [ 3 H]gabapentin saturation studies, the Kd (nM) 
obtained was about 10.62. Table 2 below provides the results of the competition studies. 



Example 11 \ 

Construction of a nucleotide sequence encoding a soluble secreted mouse ot25-3 
deletion mutant of SEP ID N°2S a% follows. 

a) Primer design \ 

PCR primers were designed to generate tW secreted soluble mouse ct25-3 deletion mutant 
of SEQ ID N° 25 as follows: \ 

5' PCR primer: This was designed to engineer in a KOZAK translation initiation 
consensus sequence prior to the coding sequence (Kozak JBC 266 19867-19870) 
3' PCR primer: This was designed to engineer in six histidine residues followed by a stop- 
codon at the desired location in the coding sequence. In addition to the stop codon the 0628- 
3 primers also included an Eco RI restriction site. 

The bold region in each primer sequence denotes the 'tagged' region; addition of 
sequences not present in the template. Primers were custom synthesized by Perkin Elmer 
Applied Biosystems UK to the ABI ready pure grade, supplied lyophilized then 
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resuspended to 15nM in lOmM TE. JB201 and 202 were provided with 5' phosphate 
groups: 

5' Primer JB&01 (5'-tcgccaccatggccgggccgggc-3' ; seq id n°27) 

3' Primer JB202 (Wtctcagtgatggtgatggtgatgcgatgcacccccacactctc-3', seqidn o 28) 
b) Protocol for PCR mediated 5' Kozak and 3' 6His tagging of mouse otg5-3 

10 The full length mouse ot28-3 gene (Gen Bank Accession number AJ010949) in the 
pcDNA3 vector as described in Brown, J.P. and Gee,N.S., Cloning and deletion 
mutagenesis of the a25 calcium channel subunit from porcine cerebral cortex, The journal 
of biological chemistry, 273(39):25458-25465) was used as the template in the following 
PCR reaction. 
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The reagents were added in the following order in triplicate to a 96 well PCR plate: 





jul 


lOx ¥ fx. Amplification buffer 


5 


WmMdNTPs 


1.5 


50mM MgS0 4 


1 


15uMJB201 


1.5 


15uMJB202 


1.5 


lOOng/ul pcDNA3-mouse-oi25-3 


1 


lOx PCR Enhancer 


5 


H 2 0 


32.7 


2.5 UNITS/ /JL^YX POLYMERASE 





0.8/jL 



The plate was the cycled on an MJ Tetrad DNA engine according to the following cycling 
conditions: 

94°C / 2mins 
followed by: 

for 30 cycles 94°C/45sec 
60°C/45sec 
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68°C / 4mins 

followed by: 
68°C / lOmins 
followed by: 
5 hold at 4°C 

The 3244bp product was then gel purified from a 1% TAE agarose gel using QIAEX beads 
and eluted in approximately 50ul 

The truncated protein of SEQ ED N°24 was expressed such the procedure of example 2,3 
10 and 4. 
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SEQUENCE LISTING 

1 - human nucleotide sequence 

ATGGCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCC 
5 CCTGGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCC 
GCGCCCGCTGTGGCTGCTGC&TGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGC 
CTACAGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTC 
GACGGCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACC 
GGAACCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGA 

1 0 CATTGAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAG 
AACTTCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATG 
ACGCCAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAA 
GGCCAGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTAT 
TCATACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATG 

1 5 AGCTCAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCAC 
ACTGCTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCG 
TGGCGAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGG 
GGGCCTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCT 
GACCCTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGAC 

20 TATGTGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGG 
TGCAGGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAA 
GGGCACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAAC 
ATCACTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCG 
TGCAGGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTC 

25 CGTGGGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGC 
TACTATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATG 
TGTTGGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGT 
GTATGAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTG 
ACACAGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACG 

3 0 TGGCTCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGT 
GTTTGCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACC 
AACTTCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGG 
AAGAGATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGT 
CAAGTC CCTGGATGAGAGGTAC ATAGATGAGGTGACACGGAACTACAC CTGGGTGC CTATA 

3 5 AGGAGCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAG 
CCAATCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTT 
TGAGTCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCA 
GACAACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACT 
CCAAGCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCA 



38 



GCTGGTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTC 
GCTGCCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAG 
AGAACCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGT 
CTTCAAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTG 

x 5 GGCATCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAG 
TGGTGGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAA 
CCGTACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGC 
GAGGTTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGT 
CAAACCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCT 

1 0 GATGCTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCA 
GCCTGTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCA 
CCGTTGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCA 
GCAGCTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAG 
GGGAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCT 

1 5 CGGTAAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGC 
GCAGAGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAG 
TGCGAGGCTGGCCGG 

2 - human nucleotide sequence 

20 ATGGCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCC 
CCTGGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCC 
GCGCCCGCTGTGGCTGCTGCTGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGCC 
TACAGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTCG 
ACGGCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACCG 

25 GAACCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGAC 
ATTGAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAGA 
ACTTCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATGA 
CGCCAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAAG 
GCCAGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTATT 

3 0 CATACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATGA 
GCTCAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCACA 
CTGCTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCGT 
GGCGAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGGG 
GGCCTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCTG 

3 5 ACCCTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGACT 
ATGTGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGGT 
GCAGGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAAG 
GGCACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAACA 
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TCACTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCGT 
GCAGGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTCC 
GTGGGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGCT 
ACTATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATGT 
5 GTTGGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGTG 
TATGAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTGA 
CACAGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACGT 
GGCTCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGTG 
TTTGCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACCA 

1 0 ACTTCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGGA 
AGAGATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGTC 
AAGTCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATAA 
GGAGCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAGC 
CAATCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTTT 

1 5 GAGTCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCAG 
ACAACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACTC 
CAAGCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCAG 
CTGGTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTCG 
CTGCCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAGA 

20 GAACCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGTC 
TTCAAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTGG 
GCATCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAGT 
GGTGGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAAC 
CGTACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGCG 

25 AGGTTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGTC 
AAACCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCTG 
ATGCTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCAG 
CCTGTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCAC 
CGTTGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCAG 

30 CAGCTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAGG 
GGAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTC 
GGTAAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGCG 
CAGAGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGT 
GCGAGGCTGGCCGGCTGCTGCAGAAGGAGACGCACTGCCCAGCGGACGGCCCGGAGCAGTG 

35 TGAGCTAGTGCAGAG 
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3 - human nucleotide sequence 

ATGGCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCC 
CCTGGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCC 
GCGCCCGCTGTGGCTGCTGCTGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGCC 
5 TACAGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTCG 
ACGGCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACCG 
GAACCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGAC 
ATTGAGAGCCTTGTGGACAGGAAGGTGCAGGCGCTGAAGAGACTGGGTGATGCTGGAGAGA 
ACTTCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATGA 

1 0 CGCCAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAAG 
GCCAGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTATT 
CATACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATGA 
GCTCAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCACA 
CTGCTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCGT 

1 5 GGCGAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGGG 
GGCCTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCTG 
ACCCTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGACT 
ATGTGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGGT 
GCAGGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAAG 

20 GGCACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAACA 
TCACTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCGT 
GCAGGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTCC 
GTGGGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGCT 
ACTATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATGT 

25 GTTGGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGTG 
TATGAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTGA 
CACAGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACGT 
GGCTCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGTG 
TTTGCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACCA 

30 ACTTCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGGA 
AGAGATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGTC 
AAGTCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATAA 
GGAGCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAGC 
CAATCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTTT 

3 5 GAGTCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGAC CTGAATGCCTCAG 
ACAACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACTC 
CAAGCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCAG 
CTGGTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTCG 
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CTGCCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAGA 
GAACCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGTC 
TTCAAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTGG 
GCATCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAGT 
5 GGTGGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAAC 
CGTACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGCG 
AGGTTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGTC 
AAACCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCTG 
ATGCTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCAG 

10 CCTGTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCAC 
CGTTGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCAG 
CAGCTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAGG 
GGAGCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTC 
GGTAAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGCG 

1 5 CAGAGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGT 
GCGAGGCTGGCCGGCTGCTGCAGAAGGAGACGCACTGCCCAGCGGACGGCCCGGAGCAGTG 
TGAGCTAGTGCAGAGACCGCGATACCGGAGAGGCCCGCACATCTGCTTCGACTACAACGCG 
ACAGAAGATACCTCAGACTGTGGCCGCGGGGCC 

20 4 - human amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
ASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 

25 LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVI IVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYYFE I PS IGAIRINTQE YLDVLGRPMVLAGKEAKQVQWTNV 
YEDALGLGLWTGTLPVFNLTQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYV 

30 FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVP IRSTNYSLGLVLPP YSTFYLQANLSDQI LQVKYFE FLLPS S F 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 
FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 

35 RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQ 
QLLYGL I YHSWFQAD PAEAEGS PETRES S CVMKQTQ YYFGS VNAS YNAI IDCGNCS RL FHA 
QRLTNTNLLFWAEKPLCSQCEAGR 
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5 - human amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGD 
I E S LLDRKVQALKRLADAAENFQ KAHRWQDNI KE ED I VY YDAKADAE LDD PES EDVE RGS K 
5 AS TLRLD F I ED PNFKNKVNYS YAAVQ I PTD I YKGS TVILNE LNWTE ALENVFMENRRQD PT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGAS S PKDMVI IVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYV1WASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYYFE I PS IGAIRINTQE YLDVLGRPMVLAGKEAKQVQWTNV 

1 0 YEDALGLGLWTGTLPVFNLTQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 

1 5 FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNS FYTRKE S YD YQ AACAPQ P PGNLGAAPRGVFVPTVAD FLNLAWWTS AAAWS LFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHA 
QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQ 

20 

6 - human amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 

25 ASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASS PKDMVI IVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYYFEIPSIGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNV 

3 0 YEDALGLGLWTGTLPVFNLTQDGPGE KKNQL I LGVMG IDVALND I KRLTPNYTLGANGYV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYS LLAVFAATDGGI TRVFPNKAAEDWTENPE P FNAS F YRRS LDNHG YV 

35 FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNS FYTRKE S YD YQ AACAPQ P PGNLGAAPRGVFVPTVAD F LNLAWWTS AAAWS LFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHA 
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QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRRGPHICFDYNA 
TEDTSDCGRGA 

7 - human nucleotide sequence 

5 ATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGGCCTCGGCGCTTCTCGCTGCCG 
CGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGAGCAGCAGATACCGCTCTCCGT 
GGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAATCCATTGCTGCTAAGTACTCC 
GGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGAAAGACGTTGCCATAGAAGAAA 
TTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACATGGAAGAGATGTTTCACAAGAA 

1 0 GTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAAGCACACCTGAAACATGAATTT 
GATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGATAAATGAAAGGGAGAAAGACG 
GGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCCAAATGACCATTTTAATAATTT 
GCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACGAACATGTACAACAAAGACCCT 
GCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACAAAGTTTTTGTAGATAACTTTG 

1 5 ACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGCAAAGGGCTTTTTTAGGCAGTA 
TCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATTGCCTTCGACTGCAGGAACCGA 
AAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGGTCATTTTAGTTGACGTCAGTG 
GCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAACAGTCTCATCCATTTTGGATAC 
ACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAATGAGGAGCTTCACTATGTGGAA 

20 CCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAAACAAAGAGCACTTCAGGGAGC 
ATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGATATAGCTCTGAATGAGGCCTT 
CAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGTATCTGCAGTCAGGCCATCATG 
CTCATAACTGATGGGGCGGTGGACACCTATGATACAATCTTTGCAAAATACAATTGGCCAG 
ATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGAGGCTGCGTTTGCAGACAATCT 

25 AAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAGATCTCCACCTTGGCTGATGTG 
CAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGCCCAAAGTCATCGACCAGGAGC 
ATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCTGACTGATGATCAGGGCCCCGT 
CCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAGAACGAAACCAGATCGAAGGGC 
ATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAGAACTTCTGAAGACCATCCCCA 

3 0 AATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCACAAATAATGGRTATATCCTGAC 
GCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAGCGAAGGAAACCTAACTATAGT 
AGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATGACGTGTTGAGAAATGCTATGG 
TGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAAGACAGTGGACAAAGGGAAACG 
GGTTTTGGTGATGACAAATGACTACTATTATACAGACATCAAGGGTACTCCTTTCAGTTTA 

3 5 GGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCCGAGGGAATGTAACCATCGAAG 
AAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGCAGATGAATGGTCCTACTGCAA 
CACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTAGAAGCGATTAAGCTCTACCTA 
AAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGATCCAAGAAGTCCTTTTTGACG 
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CGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCTGGCCCTCAACAAATCTGAAAA 
TTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGCACGGGCCTCTCCAGAATCAAC 
CTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCCTGAAAGCTGGCGACAAGGAGA 
ACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAGAGCCGCTGAGCAGATTCCAGG 
5 GAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTCAATAAAAGCAATGTGGTGACA 
GCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTCCTGTGGTGGCAGCTGTAGGCA 
TTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGACTGCCAGCAGACAGTGTGCTTC 
CCTGGATGGCAAATGCTGCATCAGCTGTGATGATGAGACTGTGAATTGTTAGCTCATAGAC 
AATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGACTGGAGACTTTTTTGGTGAGA 

1 0 TCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTCCTTTAAAAGAATTACCCTTTA 
TGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGCGATGGCGCCCATGGCCTCCTG 
GATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCATGACAGAACTTGTCTTGTTCC 
TGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATATGACAGCTAAAGCCCAGAAATT 
GAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCATTCGTCTCTGAGCGCACCATC 

1 5 AAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCAAGTCCTTTGTCATCCAGCAAA 
TCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAGCTGCCTCTGTGAATCTGTGGC 
CCCCATC 

8 - human nucleotide sequence 

20 ATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGGCCTCGGCGCTTCTCGCTGCCG 
CGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGAGCAGCAGATACCGCTCTCCGT 
GGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAATCCATTGCTGCTAAGTACTCC 
GGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGAAAGACGTTGCCATAGAAGAAA 
TTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACATGGAAGAGATGTTTCACAAGAA 

25 GTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAAGCACACCTGAAACATGAATTT 
GATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGATAAATGAAAGGGACAAAGACG 
GGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCCAAATGACCATTTTAATAATTT 
GCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACGAACATGTACAACAAAGACCCT 
GCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACAAAGTTTTTGTAGATAACTTTG 

3 0 ACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGCAAAGGGCTTTTTTAGGCAGTA 
TCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATTGCCTTCGACTGCAGGAACCGA 
AAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGGTCATTTTAGTTGACGTCAGTG 
GCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAACAGTCTCATCCATTTTGGATAC 
ACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAATGAGGAGCTTCACTATGTGGAA 

3 5 CCTTGCCTGAATGGAACTTTGGTGCAAGC CGACAGGACAAACAAAGAGCACTTCAGGGAGC 
ATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGATATAGCTCTGAATGAGGCCTT 
CAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGTATCTGCAGTCAGGCCATCATG 
CTCATAACTGATGGGGCGGTGGACACCTATGATACAATCTTTGCAAAATACAATTGGCCAG 
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ATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGAGGCTGCGTTTGCAGACAATCT 
AAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAGATCTCCACCTTGGCTGATGTG 
CAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGCCCAAAGTCATCGACCAGGAGC 
ATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCTGACTGATGATCAGGGCCCCGT 
5 CCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAGAACGAAACCAGATCGAAGGGC 
ATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAGAACTTCTGAAGACCATCCCCA 
AATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCACAAATAATGGRTATATCCTGAC 
GCATCCGGAACTCAGGCTGCTGTAGGAAGAAGGAAAAAAGCGAAGGAAACCTAACTATAGT 
AGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATGACGTGTTGAGAAATGCTATGG 

1 0 TGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAAGACAGTGGACAAAGGGAAACG 
GGTTTTGGTGATGACAAATGACTACTATTATACAGACATCAAGGGTACTCCTTTCAGTTTA 
GGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCCGAGGGAATGTAACCATCGAAG 
AAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGCAGATGAATGGTCCTACTGCAA 
CACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTAGAAGCGATTAAGCTCTACCTA 

1 5 AAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGATCCAAGAAGTCCTTTTTGACG 
CGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCTGGCCCTCAACAAATCTGAAAA 
TTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGCACGGGCCTCTCCAGAATCAAC 
CTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCCTGAAAGCTGGCGACAAGGAGA 
ACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAGAGCCGCTGAGCAGATTCCAGG 

20 GAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTCAATAAAAGCAATGTGGTGACA 
GCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTCCTGTGGTGGCAGCTGTAGGCA 
TTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGACTGCCAGCAGACAGTGTGCTTC 
CCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACTGTGAATTGTTACCTCATAGAC 
AATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGACTGGAGACTTTTTTGGTGAGA 

25 TCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTCCTTTAAAAGAATTACCCTTTA 
TGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGCGATGGCGCCCATGGCCTCCTG 
GATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCATGACAGAACTTGTCTTGTTCC 
TGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATATGACAGCTAAAGCCCAGAAATT 
GAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCATTCGTCTCTGAGCGCACCATC 

3 0 AAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCAAGTCCTTTGTCATCCAGCAAA 
TCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAGCTGCCTCTGTGAATCTGTGGC 
CCCCATCACCATGGCACCCATTGAAATCAGGTATAATGAATCCCTTAAGTGTGAACGTCTA 
AAG 

35 9 - human nucleotide sequence 

ATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGGCCTCGGCGCTTCTCGCTGCCG 
CGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGAGCAGCAGATACCGCTCTCCGT 
GGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAATCCATTGCTGCTAAGTACTCC 
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GGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGAAAGACGTTGCCATAGAAGAAA 
TTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACATGGAAGAGATGTTTCACAAGAA 
GTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAAGCACACCTGAAACATGAATTT 
GATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGATAAATGAAAGGGACAAAGACG 
5 GGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCCAAATGACCATTTTAATAATTT 
GCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACGAACATGTACAACAAAGACCCT 
GCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACAAAGTTTTTGTAGATAACTTTG 
ACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGCAAAGGGCTTTTTTAGGCAGTA 
TCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATTGCCTTCGACTGCAGGAACCGA 

1 0 AAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGGTCATTTTAGTTGACGTCAGTG 
GCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAACAGTCTCATCCATTTTGGATAC 
ACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAATGAGGAGCTTCACTATGTGGAA 
CCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAAACAAAGAGCACTTCAGGGAGC 
ATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGATATAGCTCTGAATGAGGCCTT 

1 5 CAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGTATCTGCAGTCAGGCCATCATG 
CTCATAACTGATGGGGCGGTGGACACCTATGATACAATCTTTGCAAAATACAATTGGCCAG 
ATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGAGGCTGCGTTTGCAGACAATCT 
AAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAGATCTCCACCTTGGCTGATGTG 
CAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGCCCAAAGTCATCGACCAGGAGC 

20 ATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCTGACTGATGATCAGGGCCCCGT 
CCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAGAACGAAACCAGATCGAAGGGC 
ATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAGAACTTCTGAAGACCATCCCCA 
AATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCACAAATAATGGRTATATCCTGAC 
GCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAGCGAAGGAAACCTAACTATAGT 

25 AGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATGACGTGTTGAGAAATGCTATGG 
TGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAAGACAGTGGACAAAGGGAAACG 
GGTTTTGGTGATGACAAATGACTACTATTATACAGACATCAAGGGTACTCCTTTCAGTTTA 
GGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCCGAGGGAATGTAACCATCGAAG 
AAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGCAGATGAATGGTCCTACTGCAA 

3 0 CACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTAGAAGCGATTAAGCTCTACCTA 
AAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGATCCAAGAAGTCCTTTTTGACG 
CGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCTGGCCCTCAACAAATCTGAAAA 
TTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGCACGGGCCTCTCCAGAATCAAC 
CTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCCTGAAAGCTGGCGACAAGGAGA 

3 5 ACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAGAGCCGCTGAGCAGATTCCAGG 
GAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTCAATAAAAGCAATGTGGTGACA 
GCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTCCTGTGGTGGCAGCTGTAGGCA 
TTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGACTGCCAGCAGACAGTGTGCTTC 
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CCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACTGTGAATTGTTACCTCATAGAC 
AATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGACTGGAGACTTTTTTGGTGAGA 
TCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTCCTTTAAAAGAATTACCCTTTA 
TGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGCGATGGCGCCCATGGCCTCCTG 
5 GATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCATGACAGAACTTGTCTTGTTCC 
TGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATATGACAGCTAAAGCCCAGAAATT 
GAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCATTCGTCTCTGAGCGCACCATC 
AAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCGAAGTGGTTTGTGATCCAGCAAA 
TCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAGCTGCCTCTGTGAATCTGTGGC 
1 0 CCCCATCACCATGGCACCCATTGAAATCAGGTATAATGAATCCCTTAAGTGTGAACGTCTA 
AAGGCCCAGAAGATCAGAAGGGCCCAGAAGATCAGAAGGCGCCCAGAATCTTGTCATGGCT 
TCCATCCTGAGGAGAATGCAAGGGAGTGTGGGGGTGCGCCG 

10 - human amino acid sequence 

1 5 MAGPGS PRRASRGAS ALLAAALL YAALGDWRS EQQ I PLS WKLWAS AFGGE IKS I AAK YS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 
KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 

20 PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
L I TDGAVDT YDT I FAK YNWPDRKVR I FT YL I GRE AAFADNLKWMAC ANKG F FTQ I S TLAD V 
QENVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 
SVDLSEVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDIKGTPFSL 

25 GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
KGKEPLLQCDKELIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENI FNADHFPLWYRRAAEQIPGS FVYS I PFSTGPVNKSNWT 
ASTSIQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCSISCDDETVNCYLID 
NNGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSDGAHGLL 

30 DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMVWDSSCLCESVAPI 

11 - human amino acid sequence 

MAGPGS PRRAS RGAS ALLAAALL YAALGDWRS EQQ I PLS WKLWAS AFGGE I KS I AAK YS 
35 GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 
KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 
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PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
LITDGAVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACANKGFFTQISTLADV 
QElSn^YLHVLSRPKVIDQEHDVVWEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 
5 SVDLSEVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDIKGTPFSL 
GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
KGKEPLLQCDKELIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENI FNADHFPLWYRRAAEQI PGS FVYS I P FSTGPVNKSNWT 
ASTSIQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCSISCDDETVNCYLID 
10 NNGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSDGAHGLL 
DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFM\ATVDSSCLCESVAPITMAPIEIRYNESLKCERL 
K 

15 12 - human amino acid sequence 

MAGPGSPRRASRGASALLAAALLYAALGDWRSEQQIPLSWKLWASAFGGEIKSIAAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 

20 KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 
PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
L I TDGAVDT YDT I F AK YNWPDRKVR I FT YL IGRE AAFADNLKWMACANKG F FTQ I S TL AD V 
QE1WMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 

25 SVDLSEVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDIKGTPFSL 
GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 
KGKEPLLQCDKELIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENIFNADHFPLWYRRAAEQIPGSFVYSIPFSTGPVNKSNWT 
ASTSIQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCSISCDDETVNCYLID 

30 NNGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSDGAHGLL 
DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMWVDSSCLCESVAPITMAPIEIRYNESLKCERL 
KAQ K I RRR PE S CHGFH P E ENARE CGGAP 

35 13 - human nucleotide sequence 

AGTGGCCTCCTGAGAAGCAGCTTGTTCGTGGGCTCCGAGAAGGTCTCCGACAGGAAGTTCC 
TGACACCTGAGGACGAGGCCAGCGTGTTCACCCTGGACCGCTTCCCGCTGTGGTACCGCCA 
GGCCTCAGAGCATCCTGCTGGCAGCTTCGTCTTCAACCTCCGCTGGGCAGAAGGACCAGAA 
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AGTGCGGGTGAACCCATGGTGGTGACGGCAAGCACAGCTGTGGCGGTGACCGTGGACAAGA 
GGACAGCCATTGCTGCAGCCGCGGGCGTCCAAATGAAGCTGGAATTCCTCCAGCGCAAATT 
CTGGGCGGCAACGCGGCAGTGCAGCACTGTGGATGGGCCGTGCACACAGAGCTGCGAGGAC 
AGTGATCTGGACTGCTTCGTCATCGACAACAACGGGTTCATTCTGATCTCCAAGAGGTCCC 
5 GAGAGACGGGAAGATTTCTGGGGGAGGTGGATGGTGCTGTCCTGACCCAGCTGCTCAGCAT 
GGGGGTGTTCAGCCAAGTGACTATGTATGACTATCAGGCCATGTGCAAACCCTCGAGTCAC 
CACCACAGTGCAGCCCAGCCCCTGGTCAGCCCAATTTCTGCCTTCTTGACGGCGACCAGGT 
GGCTGCTGCAGGAGCTGGTGCTGTTCCTGCTGGAGTGGAGTGTCTGGGGCTCCTGGTAGGA 
CAGAGGGGCCGAGGCCAAAAGTGTCTTCCATCACTCCCACAAACACAAGAAGCAGGACCCG 
10 CTGCAGCCCTGCGACACGGAGTACCCCGTGTTCGTGTACCAGCCGGCCATCCGGGAGGCCA 
ACGGGATCGTGGAGTGCGGGCCCTGCCAGAAGGTATTTGTGGTGCAGCAGATTCCCAACAG 
TAACCTCCTCCTCCTGGTGACAGACCCCACCTGTGACTGCAGCATCTTCCCACCAGTG 

14 - human nucleotide sequence 

1 5 AGTGGCCTC CTGAGAAGCAGCTTGTTCGTGGGCTCCGAGAAGGTCTCCGACAGGAAGTTCC 
TGACACCTGAGGACGAGGCCAGCGTGTTCACCCTGGACCGCTTCCCGCTGTGGTACCGCCA 
GGCCTCAGAGCATCCTGCTGGCAGCTTCGTCTTCAACCTCCGCTGGGCAGAAGGACCAGAA 
AGTGCGGGTGAACCCATGGTGGTGACGGCAAGCACAGCTGTGGCGGTGACCGTGGACAAGA 
GGACAGCCATTGCTGCAGCCGCGGGCGTCCAAATGAAGCTGGAATTCCTCCAGCGCAAATT 

20 CTGGGCGGCAACGCGGCAGTGCAGCACTGTGGATGGGCCGTGCACACAGAGCTGCGAGGAC 
AGTGATCTGGACTGCTTCGTCATCGACAACAACGGGTTCATTCTGATCTCCAAGAGGTCCC 
GAGAGACGGGAAGATTTCTGGGGGAGGTGGATGGTGCTGTCCTGACCCAGCTGCTCAGCAT 
GGGGGTGTTCAGCCAAGTGACTATGTATGACTATCAGGCCATGTGCAAACCCTCGAGTCAC 
CACCACAGTGCAGCCCAGCCCCTGGTCAGCCCAATTTCTGCCTTCTTGACGGCGACCAGGT 

25 GGCTGCTGCAGGAGCTGGTGCTGTTCCTGCTGGAGTGGAGTGTCTGGGGCTCCTGGTACGA 
CAGAGGGGCCGAGGCCAAAAGTGTCTTCCATCACTCCCACAAACACAAGAAGCAGGACCCG 
CTGCAGCCCTGCGACACGGAGTACCCCGTGTTCGTGTACCAGCCGGCCATCCGGGAGGCCA 
ACGGGATCGTGGAGTGCGGGCCCTGCCAGAAGGTATTTGTGGTGCAGCAGATTCCCAACAG 
TAACCTCCTCCTCCTGGTGACAGACCCCACCTGTGACTGCAGCATCTTCCCACCAGTGCTG 

3 0 CAGGAGGCGACAGAAGTCAAATATAATGCCTCTGTCAAATGTGACCGGATGCGC 

15 - human nucleotide sequence 

AGTGGCCTCCTGAGAAGCAGCTTGTTCGTGGGCTCCGAGAAGGTCTCCGACAGGAAGTTCC 
TGACACCTGAGGACGAGGCCAGCGTGTTCACCCTGGACCGCTTCCCGCTGTGGTACCGCCA 
3 5 GGCCTCAGAGCATCCTGCTGGCAGCTTCGTCTTCAACCTCCGCTGGGCAGAAGGACCAGAA 
AGTGCGGGTGAACCCATGGTGGTGACGGCAAGCACAGCTGTGGCGGTGACCGTGGACAAGA 
GGACAGCCATTGCTGCAGCCGCGGGCGTCCAAATGAAGCTGGAATTCCTCCAGCGCAAATT 
CTGGGCGGCAACGCGGCAGTGCAGCACTGTGGATGGGCCGTGCACACAGAGCTGCGAGGAC 
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AGTGATCTGGACTGCTTCGTCATCGACAACAACGGGTTCATTCTGATCTCCAAGAGGTCCC 
GAGAGACGGGAAGATTTCTGGGGGAGGTGGATGGTGCTGTCCTGACCCAGCTGCTCAGCAT 
GGGGGTGTTCAGCCAAGTGACTATGTATGACTATCAGGCCATGTGCAAACCCTCGAGTCAC 
CACCACAGTGCAGCCCAGCCCCTGGTCAGCCCAATTTCTGCCTTCTTGACGGCGACCAGGT 
5 GGCTGCTGCAGGAGCTGGTGCTGTTCCTGCTGGAGTGGAGTGTCTGGGGCTCCTGGTACGA 
CAGAGGGGCCGAGGCCAAAAGTGTCTTCCATCACTCCCACAAACACAAGAAGCAGGACCCG 
CTGCAGCCCTGCGACACGGAGTACCCCGTGTTCGTGTACCAGCCGGCCATCCGGGAGGCCA 
ACGGGATCGTGGAGTGCGGGCCCTGCCAGAAGGTATTTGTGGTGCAGCAGATTCCCAACAG 
TAACCTCCTCCTCCTGGTGACAGACCCCACCTGTGACTGCAGCATCTTCCCACCAGTGCTG 
1 0 CAGGAGGCGACAGAAGTCAAATATAATGCCTCTGTCAAATGTGACCGGATGCGCTCCCAGA 
AGCTCCGCCGGCGACCAGACTCCTGCCACGCCTTCCATCCAGAGGAGAATGCCCAGGACTG 
CGGCGGCGCCTCG 

16 - human amino acid sequence 

15 SGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAGSFVFNLRWAEGPE 
SAGEPMVVTASTAVAVTVDKRTAIAAAAGVQMKLEFLQRKFWAATRQCSTVDGPCTQSCED 
SDLDCFVIDNNGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSQVTMYDYQAMCKPSSH 
HHSAAQPLVSPISAFLTATRWLLQELVLFLLEWSVWGSWYDRGAEAKSVFHHSHKHKKQDP 
LQPCDTEYPVFVYQPAIREANGIVECGPCQKVFWQQIPNSNLLLLVTDPTCDCSIFPPV 

20 

17 - human amino acid sequence 

SGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAGSFVFNLRWAEGPE 
S AGE PMWTAS TAVAVTVDKRTAI AAAAGVQMKLE FLQRKFWAATRQCS TVDGPCTQS CED 
SDLDCFVIDNNGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSQVTMYDYQAMCKPSSH 
25 HHS AAQ PLVS P I S AFLTATRWLLQELVL FLLEWS VWGSWYDRGAE AKS VFHHS HKHKKQD P 
LQPCDTEYPVFVYQPAIREANGIVECGPCQKVFWQQIPNSNLLLLVTDPTCDCSIFPPVL 
QEATEVKYNASVKCDRMR 

18 - human amino acid sequence 

30 SGLLRSSLFVGSEKVSDRKFLTPEDEASVFTLDRFPLWYRQASEHPAGSFVFNLRWAEGPE 
SAGEPMWTASTAVAVTVDKRTAIAAAAGVQMKLE FLQRKFWAATRQCS TVDGPCTQS CED 
SDLDCFVIDNNGFILISKRSRETGRFLGEVDGAVLTQLLSMGVFSQVTMYDYQAMCKPSSH 
HHSAAQPLVSPISAFLTATRWLLQELVLFLLEWSVWGSWYDRGAEAKSVFHHSHKHKKQDP 
LQPCDTEYPVFVYQPAIREANGIVECGPCQKVFWQQIPNSNLLLLVTDPTCDCSIFPPVL 

35 QEATEVKYNASVKCDRMRSQKLRRRPDSCHAFHPEENAQDCGGAS 




19 - human alpha2 delta- 2 nucleotide sequence 

CGGGCAGCGCAGCCCGCAGAGGCGCTGCGGCCCGTGCAGCCCCGGAGGCCCCTCGCGGAGA 
AGGCGGCGGCGGAGGAGAGGCCGAGTTACCGCCCGCCGCCCGCGCCCCCCCAACCCCGCCG 
CCGCCGCCGCCGCCGCCACTGCCCCCCCTCCCCGCGGCGCCGCATCTTGAATGGAAACATG 
5 GCGGTGCCGGCTCGGACCTGCGGCGCCTCTCGGCCCGGCCCAGCGCGGACTGCGCGCCCCT 
GGCCCGGCTGCGGCCCCCACCCTGGCCCCGGCACCCGGCGCCCGACGTCCGGGCCCCCGCG 
CCCGCTGTGGCTGCTGCTGCCGCTTCTACCGCTGCTCGCCGCCCCCGGCGCCTCTGCCTAC 
AGCTTCCCCCAGCAGCACACGATGCAGCACTGGGCCCGGCGTCTGGAGCAGGAGGTCGACG 
GCGTGATGCGGATTTTTGGAGGCGTCCAGCAGCTCCGTGAGATTTACAAGGACAACCGGAA 

1 0 CCTGTTCGAGGTACAGGAGAATGAGCCTCAGAAGTTGGTGGAGAAGGTGGCAGGGGACATT 
GAGAGCCTTCTGGACAGGAAGGTGCAGGCCCTGAAGAGACTGGCTGATGCTGCAGAGAACT 
TCCAGAAAGCACACCGCTGGCAGGACAACATCAAGGAGGAAGACATCGTGTACTATGACGC 
CAAGGCTGACGCTGAGCTGGACGACCCTGAGAGTGAGGATGTGGAAAGGGGGTCTAAGGCC 
AGCACCCTAAGGCTGGACTTCATCGAGGACCCAAACTTCAAGAACAAGGTCAACTATTCAT 

1 5 ACGCGGCTGTACAGATCCCTACGGACATCTACAAAGGCTCCACTGTCATCCTCAATGAGCT 
CAACTGGACAGAGGCCCTGGAGAATGTGTTCATGGAAAACCGCAGACAAGACCCCACACTG 
CTGTGGCAGGTCTTCGGCAGCGCCACAGGAGTCACTCGCTACTACCCGGCCACCCCGTGGC 
GAGCCCCCAAGAAGATCGACCTGTACGATGTCCGAAGGAGACCCTGGTATATCCAGGGGGC 
CTCGTCACCCAAAGACATGGTCATCATCGTGGATGTGAGTGGCAGTGTGAGCGGCCTGACC 

20 CTGAAGCTGATGAAGACATCTGTCTGCGAGATGCTGGACACGCTGTCTGATGATGACTATG 
TGAATGTGGCCTCGTTCAACGAGAAGGCACAGCCTGTGTCATGCTTCACACACCTGGTGCA 
GGCCAATGTGCGCAACAAGAAGGTGTTCAAGGAAGCTGTGCAGGGCATGGTGGCCAAGGGC 
ACCACAGGCTACAAGGCCGGCTTTGAGTATGCCTTTGACCAGCTGCAGAACTCCAACATCA 
CTCGGGCCAACTGCAACAAGATGATCATGATGTTCACGGATGGTGGTGAGGACCGCGTGCA 

25 GGACGTCTTTGAGAAGTACAATTGGCCAAACCGGACGGTGCGCGTGTTTACTTTCTCCGTG 
GGGCAGCATAACTATGACGTCACACCGCTGCAGTGGATGGCCTGTGCCAACAAAGGCTACT 
ATTTTGAGATCCCTTCCATCGGAGCCATCCGCATCAACACACAGGAATATCTAGATGTGTT 
GGGCAGGCCCATGGTGCTGGCAGGCAAGGAGGCCAAGCAGGTTCAGTGGACCAACGTGTAT 
GAGGATGCACTGGGACTGGGGTTGGTGGTAACAGGGACCCTCCCTGTTTTCAACCTGACAC 

3 0 AGGATGGCCCTGGGGAAAAGAAGAACCAGCTGATCCTGGGCGTGATGGGCATTGACGTGGC 
TCTGAATGACATCAAGAGGCTGACCCCCAACTACACGCTTGGAGCCAACGGCTATGTGTTT 
GCCATTGACCTGAACGGCTACGTGTTGCTGCACCCCAATCTCAAGCCCCAGACCACCAACT 
TCCGGGAGCCTGTGACTCTGGACTTCCTGGATGCGGAGCTAGAGGATGAGAACAAGGAAGA 
GATCCGTCGGAGCATGATTGATGGCAACAAGGGCCACAAGCAGATCAGAACGTTGGTCAAG 

3 5 TCCCTGGATGAGAGGTACATAGATGAGGTGACACGGAACTACACCTGGGTGCCTATAAGGA 
GCACTAACTACAGCCTGGGGCTGGTGCTCCCACCCTACAGCACCTTCTACCTCCAAGCGAA 
TCTCAGTGACCAGATCCTGCAGGTCAAGTATTTTGAGTTCCTGCTCCCCAGCAGCTTTGAG 
TCTGAAGGACACGTTTTCATTGCTCCCAGAGAGTACTGCAAGGACCTGAATGCCTCAGACA 




ACAACACCGAGTTCCTGAAAAACTTTATTGAGCTCATGGAGAAAGTGACTCCAGACTCCAA 
GCAGTGCAACAACTTCCTTCTGCACAACCTGATCTTGGACACGGGCATCACGCAGCAGCTG 
GTAGAGCGTGTGTGGAGGGACCAGGATCTCAACACGTACAGCCTACTGGCCGTGTTCGCTG 
CCACAGACGGTGGCATCACCCGAGTCTTCCCCAACAAGGCAGCTGAGGACTGGACAGAGAA 
5 CCCTGAGCCCTTCAATGCCAGCTTCTACCGCCGCAGCCTGGATAACCACGGTTATGTCTTC 
AAGCCCCCACACCAGGATGCCCTGTTAAGGCCGCTGGAGCTGGAGAATGACACTGTGGGCA 
TCCTCGTCAGCACAGCTGTGGAGCTCAGCCTAGGCAGGCGCACACTGAGGCCAGCAGTGGT 
GGGCGTCAAGCTGGACCTAGAGGCTTGGGCTGAGAAGTTCAAGGTGCTAGCCAGCAACCGT 
ACCCACCAAGACCAGCCTCAGAAGTGCGGCCCCAACAGCCACTGTGAGATGGACTGCGAGG 

1 0 TTAACAATGAGGACTTACTCTGTGTCCTCATTGATGATGGAGGATTCCTGGTGCTGTCAAA 
CCAGAACCATCAGTGGGACCAGGTGGGCAGGTTCTTCAGTGAGGTGGATGCCAACCTGATG 
CTGGCACTCTACAATAACTCCTTCTACACCCGCAAGGAGTCCTATGACTATCAGGCAGCCT 
GTGCCCCTCAGCCCCCTGGCAACCTGGGTGCTGCACCCCGGGGTGTCTTTGTGCCCACCGT 
TGCAGATTTCCTTAACCTGGCCTGGTGGACCTCTGCTGCCGCCTGGTCCCTGTTCCAGCAG 

15 CTTCTCTACGGCCTCATCTACCACAGCTGGTTCCAAGCAGACCCCGCGGAGGCCGAGGGGA 
GCCCCGAGACGCGCGAGAGCAGCTGCGTCATGAAACAGACCCAGTACTACTTCGGCTCGGT 
AAACGCCTCCTACAACGCCATCATCGACTGCGGAAACTGCTCCAGGCTGTTCCACGCGCAG 
AGACTGACCAACACCAATCTTCTCTTTGTGGTGGCCGAGAAGCCGCTGTGCAGCCAGTGCG 
AGGCTGGCCGGCTGCTGCAGAAGGAGACGCACTGCCCAGCGGACGGCCCGGAGCAGTGTGA 

20 GCTAGTGCAGAGACCGCGATACCGGAGAGGCCCGCACATCTGCTTCGACTACAACGCGACA 
GAAGATACCTCAGACTGTGGCCGCGGGGCCTCCTTCCCGCCGTCGCTGGGCGTCCTGGTCT 
CCCTGCAACTGCTGCTCCTCCTGGGCCTGCCGCCCCGGCCGCAGCCTCAAGTCCTCGTCCA 
CGCCTCTCGCCGCCTCTGAGCACCCTGCCCCACCCCACCTCCACTCCCACCTCACCCGGCC 
TCTTCGCCTTTCCCACCCTCCTGCCCCACACTCCCCGCCTTAGAGCCTCGTCCCTCCCTCA 

25 CTGAAGGACCTGAGCTGGCCAGGCCCTGAGAGTCTGGTCTGCGCCTTGGGATGGGGAGTCC 
CAAAGCGGGACGCCGCAGGTGTTTGGCACCCAAATCACATCTCACCTCCGAACTGTTCAAG 
TGTCCCCAGACCCTTCTTGCCTGCTGGGCTCCCCCCAGTGGGATGGGACAGGGAGGCCACA 
CGCACTGGTGCCAAAACCAGGCCTCTGCTGCCGCCCTTCCTGGAGGCTGCCTATGTTGGGG 
GGGACCCTGCCTCAGCTGACCCGGCCTCTCTGCCCCACCCAAGCCCAAACTTGGTTTCTGT 

3 0 GAGAATAGTGGAGGAAGGTGAGATGGCCAGTTTGAAGCCTGTGCCTCCCAGCTTAAATCCT 
AGCAGGAGAGAGGCTCTGGGGCAGCCCCCATGGGCTCCTGCCCCTTTCAGGCCTACAGCCA 
CATCCCCAAGCCCACCAGGTGTCAGGATAGTCACAGTGATACCAGTTCAGACACTACCCCA 
TATACACCTGGAACATTGAGGATGGAAACTGGACTCACATTCGACATACCCCACTGGGCAC 
ACGCACAAACACACACACTATGGGGTGGGGTGGGTGTAGGGGCTTACAAAGCCTTACACAG 

35 GGCGAGGGGTTGGTGGGAGGGTTGGCACCTGCACACTCCATCTCCTGCTCACCACCTGCCT 
CTAATCTGAGCTGCAGCCTGGCTGGTCCTCCCATTTCTAAAGCTGAATGTCAAACAGTGCC 
AAATGCTGGGGCAGGGGGTGAAGAACCCTCTGTCCCACCCCTAGCCACCAGTGTCCTCCAA 
GTGCCCCCTCACCTCTCCAGGTGCTCATTGTAACCATTTCTCACTAGTGTCAGGCCCCCAG 




TGGGACCACATGCCACTGCCTGCACCTTTCGGCAGAGGAACCCCCACCAGACATCACCCTT 
TGCCTTAGCAGGGGTGACTTTGTCTCTCCTGGCTGGGCCATCCTTCCGCCAATCTGGCCCT 
TACACACTCAGGCCTGTGCCCACTCCCTATCTCCTTCCCACCCCTACACACACACTCCCTG 
CTTGCAGGAGGCCAAACTGTCCCTCCCTTGCTGAACACACACACACACACACACACACAGG 
5 TGGGGACTGGGCACAGCTCTTCACACCATTCATTCTGGTCATTTCCCCCAAAGGCATCCCA 
GCCTGGGGGCCAGTGGGGAACTGAGGGCAAGGGGATATAGTGATGGGGCTCAGATGGACTG 
GGAGGAGGGGGAGGGTGATGCATTAATTAATGGCTTCGTTAATTAATGTCATGTTGCTTGT 
CGCTTTCTGAGTGTGTGTGTGTGGTCCATGGCCACTGCTGGTGCCAGGGTGGGTGTCGATG 
TGCACCCGGCCTGGATGCCAGCTGTGTCCTTCGGGGGCGTGCGTGTAACTGTAGTGTAGTC 

1 0 AGGTGCTCAATGGAGAATATAAACATATACAGAAAAATATATATTTTAAGTTTAAAAAACA 
GAAAAACAGACAAAACAATCCCCATCAGGTAGCTGTCTAACCCCCAGCTGGGTCTAATCCT 
TCTCATTACCCACCCGACCTGGCTGCCCCTCACCTTGGGCTGGGGGACTGGGGGGCCATTT 
CCTTTTCTCTGCCCTTTTTTTGTTGTTCTATTTTGTACAGACAAGTTGGAAAAACAACAGC 
GACAAAAAAGTCAAGAAACTTTGTAAAATATCGTGTGTGTGATTCCTTGTAAAATATTTTC 

1 5 AAATGGTTTATTACAGAAGATCAGTTATTAAATAATGTTCATATTTTCACTTC 

20- human alpha2 delta -2 amino acid sequence 

MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YSFPQQHTMQHWARRLEQEVDGVMRIFGGVQQLREIYKDNRNLFEVQENEPQKLVEKVAGD 

20 IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
ASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASS PKDMVI IVDVSGSVSGL 
TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 

25 VGQHNYDVTPLQWMACANKG YYFE IPS IGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNV 
YEDALGLGL WTGTL P V FNLTQDG PGE KKNQ L I LGVMG I D VALND I KRLTPNYTLGANG YV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 
KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 

30 LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 
FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 
MLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPWADFLNLAWWTSAAAWSLFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHA 

35 QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRRGPHICFDYNA 
TEDTSDCGRGASFPPSLGVLVSLQLLLLLGLPPRPQPQVLVHASRRL 
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21 - human alpha2 delta- 3 nucleotide sequence 

TACTATAGGGCGGCCGCGAATTCGGCACGAGGCGGCGCGGAGCGGAGCAGGCAGCCCCGCG 
CGCTCGCCCACCGCCCGCTCCGCGCAGCTCCCCGCGGCCGCTCTCGTCGCCGCCGCAGCGG 
GCGCGTCGGAGGGAGCCCAGCATGGCCGGGCCGGGCTCGCCGCGCCGCGCGTCCCGGGGGG 
5 CCTCGGCGCTTCTCGCTGCCGCGCTTCTCTACGCCGCGCTGGGGGACGTGGTGCGCTCGGA 
GCAGCAGATACCGCTCTCCGTGGTGAAGCTCTGGGCCTCGGCTTTTGGTGGGGAGATAAAA 
TCCATTGCTGCTAAGTACTCCGGTTCCCAGCTTCTGCAAAAGAAATACAAAGAGTATGAGA 
AAGACGTTGCCATAGAAGAAATTGATGGCCTCCAACTGGTAAAGAAGCTGGCAAAGAACAT 
GGAAGAGATGTTTCACAAGAAGTCTGAGGCCGTCAGGCGTCTGGTGGAGGCTGCAGAAGAA 

1 0 GCACACCTGAAACATGAATTTGATGCAGACTTACAGTATGAATACTTCAATGCTGTGCTGA 
TAAATGAAAGGGACAAAGACGGGAATTTTTTGGAGCTGGGAAAGGAATTCATCTTAGCCCC 
AAATGACCATTTTAATAATTTGCCTGTGAACATCAGTCTAAGTGACGTCCAAGTACCAACG 
AACATGTACAACAAAGACCCTGCAATTGTCAATGGGGTTTATTGGTCTGAATCTCTAAACA 
AAGTTTTTGTAGATAACTTTGACCGTGACCCATCTCTCATATGGCAGTACTTTGGAAGTGC 

1 5 AAAGGGCTTTTTTAGGCAGTATCCGGGGATTAAATGGGAACCAGATGAGAATGGAGTCATT 
GCCTTCGACTGCAGGAACCGAAAATGGTACATCCAGGCAGCAACTTCTCCGAAAGACGTGG 
TCATTTTAGTTGACGTCAGTGGCAGCATGAAAGGACTCCGTCTGACTATCGCGAAGCAAAC 
AGTCTCATCCATTTTGGATACACTTGGGGATGATGACTTCTTCAACATAATTGCTTATAAT 
GAGGAGCTTCACTATGTGGAACCTTGCCTGAATGGAACTTTGGTGCAAGCCGACAGGACAA 

20 ACAAAGAGCACTTCAGGGAGCATCTGGACAAACTTTTCGCCAAAGGAATTGGAATGTTGGA 
TATAGCTCTGAATGAGGCCTTCAACATTCTGAGTGATTTCAACCACACGGGACAAGGAAGT 
ATCTGCAGTCAGGCCATCATGCTCATAACTGATGGGGCGGTGGACACCTATGATACAATCT 
TTGCAAAATACAATTGGCCAGATCGAAAGGTTCGCATCTTCACATACCTCATTGGACGAGA 
GGCTGCGTTTGCAGACAATCTAAAGTGGATGGCCTGTGCCAACAAAGGATTTTTTACCCAG 

25 ATCTCCACCTTGGCTGATGTGCAGGAGAATGTCATGGAATACCTTCACGTGCTTAGCCGGC 
CCAAAGTCATCGACCAGGAGCATGATGTGGTGTGGACCGAAGCTTACATTGACAGCACTCT 
GACTGATGATCAGGGCCCCGTCCTGATGACCACTGTAGCCATGCCTGTGTTTAGTAAGCAG 
AACGAAACCAGATCGAAGGGCATTCTTCTGGGAGTGGTTGGCACAGATGTCCCAGTGAAAG 
AACTTCTGAAGACCATCCCCAAATACAAGTTAGGGATTCACGGTTATGCCTTTGCAATCAC 

3 0 AAATAATGGRTATATCCTGACGCATCCGGAACTCAGGCTGCTGTACGAAGAAGGAAAAAAG 
CGAAGGAAACCTAACTATAGTAGCGTTGACCTCTCTGAGGTGGAGTGGGAAGACCGAGATG 
ACGTGTTGAGAAATGCTATGGTGAATCGAAAGACGGGGAAGTTTTCCATGGAGGTGAAGAA 
GACAGTGGACAAAGGGAAACGGGTTTTGGTGATGACAAATGACTACTATTATACAGACATC 
AAGGGTACTCCTTTCAGTTTAGGTGTGGCGCTTTCCAGAGGTCATGGGAAATATTTCTTCC 

3 5 GAGGGAATGTAACCATCGAAGAAGGCCTGCATGACTTAGAACATCCCGATGTGTCCTTGGC 
AGATGAATGGTCCTACTGCAACACTGACCTACACCCTGAGCACCGCCATCTGTCTCAGTTA 
GAAGCGATTAAGCTCTACCTAAAAGGCAAAGAACCTCTGCTCCAGTGTGATAAAGAATTGA 
TCCAAGAAGTCCTTTTTGACGCGGTGGTGAGTGCCCCCATTGAAGCGTATTGGACCAGCCT 
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GGCCCTCAACAAATCTGAAAATTCTGACAAGGGCGTGGAGGTTGCCTTCCTCGGCACTCGC 
ACGGGCCTCTCCAGAATCAACCTGTTTGTCGGGGCTGAGCAGCTCACCAATCAGGACTTCC 
TGAAAGCTGGCGACAAGGAGAACATTTTTAACGCAGACCATTTCCCTCTCTGGTACCGAAG 
AGCCGCTGAGCAGATTCCAGGGAGCTTCGTCTACTCGATCCCATTCAGCACTGGACCAGTC 
5 AATAAAAGCAATGTGGTGACAGCAAGTACATCCATCCAGCTCCTGGATGAACGGAAATCTC 
CTGTGGTGGCAGCTGTAGGCATTCAGATGAAACTTGAATTTTTCCAAAGGAAGTTCTGGAC 
TGCCAGCAGACAGTGTGCTTCCCTGGATGGCAAATGCTCCATCAGCTGTGATGATGAGACT 
GTGAATTGTTACCTCATAGACAATAATGGATTTATTTTGGTGTCTGAAGACTACACACAGA 
CTGGAGACTTTTTTGGTGAGATCGAGGGAGCTGTGATGAACAAATTGCTAACAATGGGCTC 

1 0 CTTTAAAAGAATTACCCTTTATGACTACCAAGCCATGTGTAGAGCCAACAAGGAAAGCAGC 
GATGGCGCCCATGGCCTCCTGGATCCTTATAATGCCTTCCTCTCTGCAGTAAAATGGATCA 
TGACAGAACTTGTCTTGTTCCTGGTGGAATTTAACCTCTGCAGTTGGTGGCACTCCGATAT 
GACAGCTAAAGCCCAGAAATTGAAACAGACCCTGGAGCCTTGTGATACTGAATATCCAGCA 
TTCGTCTCTGAGCGCACCATCAAGGAGACTACAGGGAATATTGCTTGTGAAGACTGCTCCA 

1 5 AGTCCTTTGTCATCCAGCAAATCCCAAGCAGCAACCTGTTCATGGTGGTGGTGGACAGCAG 
CTGCCTCTGTGAATCTGTGGCCCCCATCACCATGGCACCCATTGAAATCAGGTATAATGAA 
TCCCTTAAGTGTGAACGTCTAAAGGCCCAGAAGATCAGAAGGCGCCCAGAATCTTGTCATG 
GCTTCCATCCTGAGGAGAATGCAAGGGAGTGTGGGGGTGCGCCGAGTCTCCAAGCCCAGAC 
AGTCCTCCTTCTGCTCCCTCTGCTTTTGATGCTCTTCTCAAGGTGACACTGACTGAGATGT 

20 TCTCTTACTGACTGAGATGTTCTCTTGGCATGCTAAATCATGGATAAACTGTGAACCAAAA 
TATGGTGCAACATACGAGACATGAATATAGTCCAACCATCAGCATCTCATCATGATTTTAA 
ACTGTGCGTGATATAAACTCTTAAAGATATGTTGACAAAAAGTTATCTATCATCTTTTTAC 
TTTGCCAGTCATGCAAATGTGAGTTTGCCACATGATAATCACCCTTCATCAGAAATGGGAC 
CGCAAGTGGTAGGCAGTGTCCCTTCTGCTTGAAACCTATTGAAACCAATTTAAAACTGTGT 

25 ACTTTTTAAATAAAGTATATTAAAATCATAAAAAAAAAAAAAAAAAAAA 



22 - human alpha2 delta-3 amino acid sequence 

MAGPGSPRRASRGASALLAAALLYAALGDWRSEQQIPLSWKLWASAFGGEIKSIAAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKNMEEMFHKKSEAVRRLVEAAEEAHLKHEF 

30 DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 
KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIIAYNEELHYVE 
PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
LITDGAVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACANKGFFTQISTLADV 

35 QENVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLTDDQGPVLMTTVAMPVFSKQNETRSKG 
ILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRLLYEEGKKRRKPNYS 
SVDLSEVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDIKGTPFSL 
GVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLEAIKLYL 




KGKEPLLQCDKELIQEVLFDAWSAPIEAYWTSLALNKSENSDKGVEVAFLGTRTGLSRIN 
LFVGAEQLTNQDFLKAGDKENIFNADHFPLWYRRAAEQIPGSFVYSIPFSTGPVNKSNWT 
ASTS IQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCS ISCDDETVNCYLID 
NNGFILVSEDYTQTGDFFGEIEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSDGAHGLL 
5 DPYNAFLSAVKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAFVSERTI 
KETTGNIACEDCSKSFVIQQIPSSNLFMVWDSSCLCESVAPITMAPIEIRYNESLKCERL 
KAQKIRRRPESCHGFHPEENARECGGAPSLQAQTVLLLLPLLLMLFSR 

23 - human amino acid sequence 

10 MAVPARTCGASRPGPARTARPWPGCGPHPGPGTRRPTSGPPRPLWLLLPLLPLLAAPGASA 
YS FPQQHTMQHWARRLEQEVDGVMRI FGGVQQLRE I YKDNRNLFEVQENE PQKLVEKVAGD 
IESLLDRKVQALKRLADAAENFQKAHRWQDNIKEEDIVYYDAKADAELDDPESEDVERGSK 
ASTLRLDFIEDPNFKNKVNYSYAAVQIPTDIYKGSTVILNELNWTEALENVFMENRRQDPT 
LLWQVFGSATGVTRYYPATPWRAPKKIDLYDVRRRPWYIQGASSPKDMVIIVDVSGSVSGL 

1 5 TLKLMKTSVCEMLDTLSDDDYVNVASFNEKAQPVSCFTHLVQANVRNKKVFKEAVQGMVAK 
GTTGYKAGFEYAFDQLQNSNITRANCNKMIMMFTDGGEDRVQDVFEKYNWPNRTVRVFTFS 
VGQHNYDVTPLQWMACANKGYYFEIPSIGAIRINTQEYLDVLGRPMVLAGKEAKQVQWTNV 
YEDALGLGLWTGTLPVFNLTQDGPGEKKNQLILGVMGIDVALNDIKRLTPNYTLGANGYV 
FAIDLNGYVLLHPNLKPQTTNFREPVTLDFLDAELEDENKEEIRRSMIDGNKGHKQIRTLV 

20 KSLDERYIDEVTRNYTWVPIRSTNYSLGLVLPPYSTFYLQANLSDQILQVKYFEFLLPSSF 
ESEGHVFIAPREYCKDLNASDNNTEFLKNFIELMEKVTPDSKQCNNFLLHNLILDTGITQQ 
LVERVWRDQDLNTYSLLAVFAATDGGITRVFPNKAAEDWTENPEPFNASFYRRSLDNHGYV 
FKPPHQDALLRPLELENDTVGILVSTAVELSLGRRTLRPAWGVKLDLEAWAEKFKVLASN 
RTHQDQPQKCGPNSHCEMDCEVNNEDLLCVLIDDGGFLVLSNQNHQWDQVGRFFSEVDANL 

25 MLALYNNSFYTRKESYDYQAACAPQPPGNLGAAPRGVFVPTVADFLNLAWWTSAAAWSLFQ 
QLLYGLIYHSWFQADPAEAEGSPETRESSCVMKQTQYYFGSVNASYNAIIDCGNCSRLFHA 
QRLTNTNLLFWAEKPLCSQCEAGRLLQKETHCPADGPEQCELVQRPRYRRGPHICFDYNA 
TEDTSDCGRGAHHHHHH 

30 24- mouse amino acid sequence 

MAGPGS LCCASRGASALLATALLYAALGDWRS EQQI PLS WKLWAS AFGGE IKS I AAKYS 
GSQLLQKKYKEYEKDVAIEEIDGLQLVKKLAKIMEEMFHKKSEAVRRLVEAAEEAHLKHEF 
DADLQYEYFNAVLINERDKDGNFLELGKEFILAPNDHFNNLPVNISLSDVQVPTNMYNKDP 
AIVNGVYWSESLNKVFVDNFDRDPSLIWQYFGSAKGFFRQYPGIKWEPDENGVIAFDCRNR 
35 KWYIQAATSPKDWILVDVSGSMKGLRLTIAKQTVSSILDTLGDDDFFNIITYNEELHYVE 
PCLNGTLVQADRTNKEHFREHLDKLFAKGIGMLDIALNEAFNILSDFNHTGQGSICSQAIM 
LITDGAVDTYDTIFAKYNWPDRKVRIFTYLIGREAAFADNLKWMACANKGFFTQISTLADV 
QENVMEYLHVLSRPKVIDQEHDVVWTEAYIDSTLPQAQKLADDQGLVLMTTVAMPVFSKQN 
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ETRSKGILLGWGTDVPVKELLKTIPKYKLGIHGYAFAITNNGYILTHPELRPLYEEGKKR 
RKPNYSSVDLSEVEWEDRDDVLRNAMVNRKTGKFSMEVKKTVDKGKRVLVMTNDYYYTDIK 
GTPFSLGVALSRGHGKYFFRGNVTIEEGLHDLEHPDVSLADEWSYCNTDLHPEHRHLSQLE 
AI KL YLKGKE PLLQ CDKEL I QEVLFDAWS AP I EAYWTS LALNKS ENSDKGVEVAFLGTRT 
5 GLSRINLFVGAEQLTNQDFLKAGDKENIFNADHFPLWYRRAAEQIAGSFVYSIPFSTGTVN 
KSNWTASTSIQLLDERKSPWAAVGIQMKLEFFQRKFWTASRQCASLDGKCSISCDDETV 
NCYLIDNNGFILVSEDYTQTGDFFGEVEGAVMNKLLTMGSFKRITLYDYQAMCRANKESSD 
SAHGLLDPYKAFLSAAKWIMTELVLFLVEFNLCSWWHSDMTAKAQKLKQTLEPCDTEYPAF 
VSERTIKETTGNIACEDCSKSFVIQQIPSSNLFMVWDSSCLCESVAPITMAPIEIRYNES 
10 LKCERLKAQKIRRRPESCHGFHPEENARECGGASHHHHHH 

25 - nucleotide sequence 

TCGCCACCATGGCGGTGCCGGCTC 

15 26 - nucleotide sequence 

TCGGAATTCCTCAGTGATGGTGATGGTGATGGGCCCCGCGGCCACAGTC 

27 - nucleotide sequence 

TCGCCACCATGGCCGGGCCGGGC 
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2 8 - nucleotide sequence 



TCTCAGTGATGGTGATGGTGATGCGATGCACCCCCACACTCTC 



